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1  Summary 

r 

During  the  period  1  July  1987  -  30  June  1988,  the  research  under  Grant  AFOSR-84-0181  has 
been  concerned  with  binary  parallel  optical  computing  architectures  with  particular  attention  to 
cellular  logic  and  symbolic  substitution  for  pattern  recognition  and  numerical  operations.  Our 
approach  has  been  to  experimentally  implement  binary  optical  cellular  logic  processors  and  inter¬ 
connection  arrays;  define  an  instruction  set  and  software  suited  to  optical  computing  systems;  and 
to  study  generalizations  of  optical  cellular  logic  processors  such  as  the  hypercube  and  pyramid. 
Recent  accomplishments  include  the  experimental  implementation  of  a  54-gate  binary  optical  cel¬ 
lular  logic  processor  with  instruction  decoders,  input/output,  memory  and  test /branch  functions: 
the  completion  of  a  binary  image  algebra  (BIA)  description  of  cellular  logic,  image  analysis  and 
symbolic  operations;  and  the  development  of  binary  image  algebra  algorithms  for  scale  and  shift 
invariant  pattern  recognition. 
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2  Research  Progress 


This  section  summarizes  research  progress  and  accomplishments  for  the  period  1  July  1987  -  30 
June  1988  on  Grant  AFOSR-84-0181  for  Nonlinear  Real-Time  Optical  Signal  Processing.  These 
results  are  discussed  separately  in  the  sections  that  follow. 


4 


An  Image  Algebra 

Representation  of  Parallel  Optical  Binary  Arithmetic 


K.  S.  Huang,  B.  K.  Jenkins,  A.  A.  Sawchuk 

Signal  and  Image  Processing  Institute 
Department  of  Electrical  Engineering 
University  of  Southern  California 
Los  Angeles,  CA  90089-0272 


“^Supported  by  the  Air  Force  Office  of  Sdeatific  Research  cadet  put  AFOSR-84-0181,  by  the  office  of  Nasal  Research 
under  contract  N 00014-8 6-K-0802,  and  by  an  IBM  pad  ante  fellowship. 

‘Portions  of  this  paper  presented  at  the  1 987  Annual  Meeting,  Optical  Society  of  America,  Rochester,  New  York,  October, 
1987  [20]. 


•Submit ted  to  Applied  Optics. 


Abstract 


Optical  computers  can  operate  on  2-D  planes  of  data  in  parallel.  Boolean  logic  equations  do  not 
provide  a  complete  description  of  such  parallel  operations  for  binary  arithmetic.  An  optical  system 
that  operates  on  planes  of  data  should  employ  an  inherently  parallel  mathematical  description  for  its 
arithmetic.  The  purposes  of  this  paper  are:  to  use  binary  image  algebra  to  develop  parallel  numerical 
computation  algorithms,  and  to  describe  the  execution  of  these  algorithms  on  a  digital  optical  cellular 
image  processor  (DOCIP)  architecture.  We  discuss  three  basic  binary  number  representations:  1)  binary 
row(or  column)-coding;  2)  binary  stack-coding;  and  3)  binary  symbol-coding  for  symbolic  substitution 
arithmetic. 
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1  Introduction 


Digital  optical  systems  hold  the  promise  of  providing  more  accuracy,  flexibility,  and  programmability 
than  analog  optical  systems,  at  the  cost  of  somewhat  lower  throughput  [1]  [2].  To  achieve  digital  optical 
computing,  there  are  at  least  three  possible  logic  systems:  residue  logic  [3]-[6],  multilevel  logic  [7]-[l0]  , 
and  binary  logic  [ll]  (12].  Because  it  is  much  easier  to  make  reliable  two  level  devices  for  binary  logic  and 
only  log-zk  of  them  are  needed  to  to  represent  k  levels,  in  this  paper  we  will  consider  only  binary  parallel 
optical  computing.  A  digital  optical  ceiluiar  image  processor  (DOCIP)  architecture  based  on  binary 
image  algebra  (BLA)  has  been  demonstrated  to  be  very  powerful  in  parallel  binary  image  processing 
[13]-[16].  This  paper  will  demonstrate  that  the  DOCIP  with  BLA.  algebraic  techniques  can  efficiently 
perform  parallel  numerical  computations  also. 

Boolean  logic  equations  for  binary  arithmetic  are  not  well-suited  to  highly  parallel  operations  on 
pianes  of  data;  they  do  not  reflect  the  location  of  data  except  typically  by  a  memory  address.  Here, 
we  first  seek  a  software  theory  for  parallel  numerical  computation  algorithms  that  simultaneously  have 
binary  digital  efficiency  and  the  advantages  of  optical  parallel  processing.  We  have  developed  a  binary 
image  algebra  (BIA)  [16],  built  from  only  3  fundamental  operations  and  5  elementary  images,  to  serve  as 
a  complete  unified  systematic  theory  for  binary  parallel  image  processing.  Now,  we  will  show  that  BLA. 
can  be  also  considered  as  a  spatial  logic  which  is  a  generalised  parallel  form  of  boolean  logic  with  an 
additional  parallel  information  transfer  ability.  BIA  then  becomes  a  formalism  and  a  general  technique 
for  developing  and  comparing  parallel  numerical  computation  algorithms  for  digital  optical  computers. 

Based  on  BIA,  parallel  numerical  computation  algorithms  for  the  DOCIP  machine  are  developed  and 
compared  to  that  far  optical  symbolic  substitution  processors  [17]-[19].  Symbolic  substitution  rules  are 
particular  BIA  image  transformations  [20]  (section  5).  Hence,  the  comparison  of  these  machines  will  be 
in  terms  of  this  BIA  algebraic  language.  Three  different  binary  number  representations  (binary  row(or 
column)-coding,  binary  stack-coding,  and  binary  symbol-coding  of  symbolic  substitution)  for  binary 
arithmetic  in  the  DOCIP  machine  are  developed.  Parallel  operations  of  binary  addition,  subtraction  and 
multiplication  are  derived  by  BIA  and  illustrated  as  examples.  Parallelism  is  achieved  by  performing 
arithmetic  operations  on  many  pain  of  operands  simultaneously.  The  carries  foT  each  pair  of  operands 
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are  essentially  propagated  serially  to  keep  hardware  complexity  low  [21].  This  enables  speed-ups  close  to. 
and  in  some  cases  equal  to,  linear  to  be  obtained.  In  this  paper  we  will  consider  only  positive  numbers. 
A  suitable  digital  number  representation  will  easily  provide  for  negative  numbers  also.  For  example, 
two’s  complement  arithmetic  can  be  performed  with  only  minor  modifications  to  the  algorithms  and 
programs  given  in  this  paper,  and  with  the  addition  of  one  more  bit  (the  sign  bit)  to  each  operand  and 
result. 

Section  2  gives  a  brief  review  of  BLA  and  the  DOCIP  architecture.  Section  3  presents  binary  rowfor 
column)-coded  arithmetic:  binary  addition,  binary  subtraction,  and  binary  multiplication  (including  a 
matrix-constant  multiplication  and  an  element-element  multiplication).  Section  4  presents  binary  stack- 
coded  arithmetic.  Section  5  gives  a  BIA  representation  of  symbolic  substitution  and  discusses  binary 
symbol-coded  arithmetic  (symbolic  substitution  arithmetic).  Section  6  gives  a  comparison  for  the  above 
different  number  representations. 

2  Binary  Image  Algebra  (BIA)  and  DOCIP  Architecture 

2.1  Review  of  Binary  Image  Algebra 

We  give  here  a  very  brief  summary  of  BIA.  Details  axe  contained  in  Ref.  [16]. 

A  binary  digital  image  is  usually  defined  as  a  function  /  mapping  each  grid  point  (x,y)  of  an 
orthogonal  coordinate  system  onto  the  set  composed  of  two  elements:  1  (white,  bright,  i.e.  image  point) 
and  0  (black,  dark,  i.e.  background  point).  However,  it  will  be  more  convenient  for  our  image  algebra  to 
use  only  the  set  of  coordinates  of  image  points  ('l’s)  to  specify  an  image.  In  BIA,  an  image  is  then  treated 
as  a  set  of  coordinates  of  image  points  (pixels  that  have  value  1).  This  paper  deals  with  only  binary 
arithmetic;  hence,  an  image  point  represents  a  binary  bit  with  value  1,  a  background  point  represents  a 
binary  bit  with  value  0,  and  an  image  is  a  finite  2-D  bit  plane.  We  list  here  only  those  basic  definitions 
and  operations  which  will  be  referred  to  latter. 

Definition  of  Binary  Image  Algebra  (BIA) 

Binary  image  algebra  is  an  algebra  with  an  image  space  5  and  a  family  F  of  operations 
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including  5  elementary  images  and  3  fundamental  operations.  Symbolically, 

3lA  =  (P(W);^,urJ,A,A~\B,B-1)  (1) 

where  S  =  P(W)  and  F  =  (0,U ,~,I,A,A~l,B,B~l).  The  image  space  S,  the  family  F  of 
operations,  and  all  other  symbols  are  defined  in  the  following. 

1.  The  Universal  Image  (the  bit  plane  containing  all  bits  with  value  1):  The  universal  image  is  a  set 
W  =  {(x,y)  |  x  €  Zn,y  €  Zn},  where  Zn  =  {0,±1,±2,  ...,±n)  and  n  is  a  positive  integer. 

2.  Image  Space  (the  set  of  all  possible  bit  planes):  The  image  space  is  the  power  set  (the  set  of  ail 
subsets)  of  the  universal  image,  i.e.  5  =  P(W). 

3.  Image  (bit  plane):  A  set  X  is  an  image  if  and  only  if  X  is  an  element  of  the  image  space  5,  i.e.  X 
is  a  subimage  of  the  universal  image  W. 

4.  Image  Point  (a  bit  with  value  1):  A  point  (bit)  (2,  y)  is  an  image  point  of  an  image  X  if  and  only 
if  (2,  y)  is  an  element  of  the  set  X. 

5.  Image  Transformation  (a  mapping  between  bit  planes):  An  image  transformation  T  is  a  function 
mapping  the  image  space  5  into  the  image  space  5. 

6.  Three  Fundamental  Operations  (Fig.  1): 

(a)  Complement  of  an  image  X:  • 

I={(*.y)l(*,j)€^A(i,j()^}  (2) 

(b)  Union  of  two  images  X  and  R: 

XuP  =  {(x,y)|(2,y)€Xv(x,y)€P}  (3) 

(c)  Dilation  of  two  images  X  and  R: 

{(*1 +*t,Pi +  »)  €  w  I  €  X,(2j,yj)  €  P}  (X  ji  *)  a  (R  ?k  #) 

X  ©  R  =  (4) 

otherwise 
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Remark:  “£”  denotes  “belongs  to”,  “A"  denotes  “and”,  *v"  denotes  “or”,  and  “<£”  is  the  null 
image  having  no  image  point.  Note  that  X  usually  represents  an  input  image  and  R  is  a  reference 
image  containing  predefined  information.  We  can  define  other  image  operations  as  fundamental 
operations  instead  of  these  three  operations.  The  reason  for  choosing  these  three  operations  is 
because  of  their  simplicity,  simple  software  design  and  simple  hardware  implementation.  Dilation 
can  be  interpreted  as  a  parallel  mathematical  formalism  of  the  pattern  substitution  step  in  symbolic 
substitution  (section  5). 

7.  Five  Elementary  Images:  There  are  5  elementary  images: 

(a)  I  =  {(0,0)}  —  consisting  of  an  image  point  at  the  origin 

(b)  A  =  {(1,0)}  —  consisting  of  an  image  point  right  of  the  origin 

(c)  A"1  =  {(-1,0)}  —  consisting  of  an  image  point  left  of  the  origin 

(d)  B  —  {(0, 1)}  —  consisting  of  an  image  point  above  the  origin 

(e)  B~l  =  {(0,  -1)}  —  consisting  of  an  image  point  below  the  origin 

In  fact,  these  5  elementary  images  could  be  reduced  to  4  elementary  images,  because  I  —  A©  A-1  = 
B  ©  B~l.  Any  (reference)  image  can  be  represented  as 

X=  U  A'B ’  (5) 

where  A’  B*  3  A*  ©\0J, 

A*  =  A  ©  A  ©  ...  ©  A  =  {(»\0)}  if  i  >  0, 

Ai  3  A-1  ©  A-1  ©  ...  ©  A~\  =  {(»,  0)}  if »  <  0, 

S  W  ■■■■!■■  |»IV  ••  ——  — I-' 

•I 

A0  3  A  ©  A-1  =  I. 

8.  Reflected  Imager.  Given  an  Image  R,  its  refected  image  is  defined  as 

A  *{(-*,-„)  |  (»,F)  6*}.  (6) 

9.  Some  Standard  Derived  Operations: 
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(a.)  Difference  of  X  by  R  (Fig.  2(a)): 


XI R  =  {(x,  y)  I  (*,»)  6  X  A  (x,y)  t  R}  =  X  n  7?  =  Xu  R  (7) 

Remark:  X  =  W/X  where  W  is  the  universal  image. 

(b)  Intersection  of  two  images  X  and  R  (Fig.  2(b)): 

Xn  R  =  {(x,  y)  |  {x,y)  6  X  A  (x,y)  £fl}  =  lu  f?  (3) 

Remark:  X  U  R  =  X  n  3?. 

(c)  Erosion  of  an  image  X  by  a  reference  image  R  (Fig.  2(c)): 

X  0  R  =  X  0  R  (9) 

where  R  is  defined  above. 

Remark:  X  ®  R  —  X  0  R.  The  erosion  of  an  image  X  by  a  reference  image  R  can  be  thought 
as  the  complement  of  the  dilation  of  the  background  by  the  reflection  of  the  reference  image 
R.  In  general,  the  erosion  of  a  non-null  image  X  by  a  non-null  reference  image  R  decreases 
the  size  of  regions,  increases  the  size  of  holes,  eliminates  regions,  and  breaks  bridges  in  X. 

(d)  Symmetric  difference  of  two  images  (Fig.  2(d)): 

XAR  =  (X/R)u(R/X)  =  XuRuffuX  (10) 

< 

Remark:  The  symmetric  difference  is  a  commutative  operation,  and  is  its  own  inverse. 

(e)  Hit  or  miss  transform  ®  of  an  image  X  by  an  image  pair  R  =  (Ri.Rj)  (Fig.  2(e)): 

X0R  =  (X9Ri)n(XeR,)  =  (Xe  Rx)  u  (X®  R,)  (11) 

Remark:  The  hit  or  «i««  transform  of  an  image  X  by  a  reference  image  pair  R  =  (Ri,  R?) 
formally  describes  the  pattern  recognition  step  in  symbolic  substitution  (section  5);  and  it  is 
used  to  match  the  shape  (or  template)  defined  by  the  reference  image  pair  R  where  Rt  defines 
the  foreground  of  the  shape  and  Rj  defines  the  background  of  the  shape.  The  conditions  are 
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that  the  foreground  X  must  match  R\  (i.e.  XQ  R\),  while  simultaneously  the  background  X 
matches  Rj  (i.e.  X 0  Ri)-  Note  the  similarity  of  the  symmetric  difference  (parallel  bit-wise 
comparison)  and  the  hit  or  miss  transform  (parallel  shape  or  symbol  recognition). 

The  important  results  of  BIA  are:  (1)  any  image  transformation  can  be  implemented  by  the  three 
fundamental  operations  with  appropriate  reference  images;  (2)  any  reference  image  can  be  generated 
from  the  elementary  images  by  using  the  three  fundamental  operations;  and  ^  3)  BIA  provides  an  efficient 
representation  for  many  parallel  image  processing  algorithms  (e.g.  shape  and  sine  verifications  [16]).  Here 
we  will  demonstrate  that  BIA  is  also  a  fundamental  tool  for  parallel  numerical  computation. 

2.2  Review  of  DOCIP  Architecture 

We  have  designed  a  class  of  the  digital  optical  cellular  image  processors  (DOCIPs)  for  effectively  imple¬ 
menting  BIA  [13]-[15].  Here  we  only  summarize  their  major  characteristics.  Details  are  given  in  [14]  [15]. 
To  map  BIA  into  the  DOCIP  architecture  in  a  transparent  way,  we  first  define  the  DOCIP  algebraicly: 

Definition  of  Cellular  Automata 

A  cellular  automaton  is  an  algebra  A  =  (5;  F,  Nc)  where  5  is  the  state  space  which  is  a 
set  of  states,  F  is  a  family  of  transition  functions,  and  Ne  is  the  neighborhood  configuration. 

Constraints  on  a  cellular  automaton  for  Implementing  BIA: 

1.S3  P(W) 

2.  F  D  {©,U,-} 

3.  Ne  J  I U  A  U  A~l  U  B  U  B~l  or  NeD  AU  A~l  uSu  B”1 
where  “D"  means  “contains”. 

Thus,  in  terms  of  cellular  automata,  the  DOCIPs  have  to  satisfy  the  above  constraints  for  realizing 
BIA.  For  storing  input  images  and  temporary  results  in  a  more  flexible  way,  the  DOCIPs  utilize  three 
memory  modules  and  all  share  the  same  algebraic  structure  (except  the  neighborhood  configuration): 

DOCIP  =  (P{W  x  Wx  W);9,'u,-,jVe)  (12) 
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where  “x”  denotes  cross  product  and  Ne  can  be  one  of  the  following  4  types: 

1.  DOCIP-array4:  each  cell  connects  with  its  four  nearest  neighbors  and  itself,  i.e. 

=  Iu  AD  A~l  U  B\J  B~l.  (13) 

2.  DOCIP-array8:  each  cell  connects  with  its  eight  nearest  neighbors  and  itself)  i.e. 

i 

iVami¥s  =  U  A'B>.  ■  (14) 

3.  DOCIP-hypercube4:  each  cell  connects  with  those  cells  in  the  4  directions  at  distances  1, 2, 4, 8, ...,  2k 
from  itself,  i.e. 

U  U’US1)  (15) 

« i,±l.±2.....±2* 

where  k  is  sufficiently  large  for  the  connections  to  traverse  the  entire  array  of  cells. 

4.  DOCIP-hypercube8:  each  cell  connects  with  those  cells  in  the  8  directions  at  distances  1, 2, 4, 8, ...,  2* 
from  itself,  i.e. 

U  (A*  UB'U  A'B*  U  A'B~')  (16) 

— «,±1,±2^2* 

From  the  above  algebraic  description,  the  DOCIPs  have  the  same  algebraic  structure  and  differ  only 
in  their  neighborhood  configurations  Nc.  Thus,  they  share  the  same  architecture  shown  in  Fig.  3,  but 
have  different  configurations  of  the  reference  images  Ei  depending  on  the  optical  interconnection  network 
which  defines  the  neighborhood.  In  practical  applications,  a  larger  reference  image  R  can  be  generated 
from  a  set  of  smaller  reference  image(s)  E%  by  a  “sequential  dilation” .  If  it  is  possible  to  decompose  R 
into  a  sequence  R  =*  Ei  ©  E7  ©  ...  8  Eh,  then 

X  ©  R  *  (...((X  ©  Ei)  ©  E7)  ©  ...  ©  ?*).  (17) 

This  decomposition  may  not  east,  in  which  case  R  can  always  be  decomposed  as  R  =  fii  u  R7  u ...  u  A*, 
and  then 

X  ©  R  =  ( X  ©  Ai)  U  (X  ©  R7)  U  ...  U  {X  ©  Rk)  (18) 

where  each  R}  can  be  decomposed  into  smaller  reference  images  Ei  [14]  [22]. 
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Basically,  the  proposed  DOCIP  shown  in  Fig.  3  is  a  cellular  SIMD  machine  and  consists  of  an  array 
of  ceiis  or  processing  elements  (PEs)  under  the  supervision  of  a  control  unit.  The  control  unit  includes 
a  clock,  a  program  counter,  a  test  and  branch  module  for  feedback  control,  and  an  instruction  decoder 
for  storing  instructions  and  decoding  them  to  supervise  cells.  The  array  of  cells  includes  a  1  x  3  line 
destination  selector,  where  each  line  is  iVJ  bits  wide,  three  N  x  N  x  1  bit  memories  for  storing  images, 
a  memory  selector,  and  a  dilation  unit.  It  operates  as  follows:  (1)  a  binary  image  (N  x  N  matrix)  is 
input  into  the  destination  selector  and  then  stored  in  any  memory  (or  set  of  memories)  as  the  instruction 
specifies;  (2)  after  one  to  three  images  have  been  stored,  these  images  and  their  complements  are  piped 
into  the  next  stage,  which  forms  the  union  of  any  combination  of  images  (specified  by  the  instruction); 
(3)  the  result  is  sent  to  a  dilation  unit  where  the  reference  image  specified  by  the  instruction  is  used  to 
control  the  type  of  dilation;  (4)  finally,  the  dilated  image  can  be  output,  tested  for  program  control,  or 
fed  back  to  step  (1)  as  the  instruction  specifies. 

The  DOCIP  machine  (Fig.  3)  has  one  instruction;  it  implements  the  three  fundamental  operations  of 
BIA  along  with  fetch  and  store  [22].  This  design  uses  the  parallelism  of  optics  to  simultaneously  execute 
instructions  involving  all  N2  picture  elements.  Each  instruction  takes  one  complete  cycle  to  execute. 
Note  that  the  DOCIP  machine  can  perform  a  dilation  by  any  reference  image  R  that  is  a  subset  of  the 
neighborhood  configuration,  Ne,  in  a  single  clock  cycle. 

The  entire  system  can  be  realized  by  an  optical  gate  array  with  optical  3-D  interconnections  [11] 
[12]  [23].  Fig.  4  describes  an  optical  implementation  concept  for  the  DOCIP-hypercube.  The  DOCIP 
has  very  low  cell  hardware  complexity  to  maximize  parallelism,  yet  enough  cell  sophistication  to  permit 
the  machine  to  execute  useful  programs.  The  use  of  optical  interconnections  permits  a  cellular  hyper¬ 
cube  topology  to  be  implemented  without  paying  a  large  penalty  in  chip  area  (the  cellular  hypercube 
interconnections  are  space-invariant  which  implies  relatively  low  hologram  complexity);  it  also  enables 
images  to  be  input  to  and  output  from  the  machine  in  parallel. 
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3  Binary  row(or  column)-coded  Arithmetic 

Binary  addition  of  two  Ar-bit  numbers  yields  at  most  k  -f  1  bits,  and  binary  multiplication  of  two  Ar-bit 
numbers  yieids  at  most  2k  bits.  In  this  paper,  we  assume  that  all  input  numbers  axe  padded  with  enough 
zeroes  to  avoid  the  possibility  of  overflow.  This  also  guarantees  that  the  different  operands  in  the  image 
will  be  treated  separately.  A  binary  row-coded  number  is  encoded  in  a  part  of  a  row  of  an  image. 
Although  the  word  lengths  of  numbers  do  not  need  to  be  equal,  we  assume  in  this  discussion  that  an 
image  (bit  plane)  with  N  x  N  bits  contains  N2/k  numbers  of  fc-bit  length  as  a  simple  illustration  (Fig. 
5).  In  this  section,  we  describe  parallel  addition,  subtraction  and  multiplication  by  BIA  expressions  and 
their  programs  on  the  DOCIP  machine. 


3.1  Addition  of  Binary  row-coded  Numbers 

Consider  an  image  X  (e.g.  Fig.  6(a))  composed  of  N2/k  numbers  x„ »  =  1, 2, ...,  N2/k,  an  image  R  (e.g. 
Fig.  6(b))  composed  of  N2/k  numbers  r;,i  =  1, 2,...,jV2/Jfc,  and  the  output  of  the  addition  5  =  X  +  R 
(Fig.  6(c)).  To  realize  this  addition  in  parallel  by  means  of  BIA,  we  first  consider  the  serial  (carray- 
propagate)  addition  of  2  binary  numbers  s,-  =  x<  +  r{.  The  first  step  of  serial  addition  is  to  add  the  least 
significant  bits,  say  x,(a)  and  r,(0).  The  boolean  logic  equations  for  adding  the  two  least  significant  bits 
(half-adder)  are 


•  sum  bit:  st(o)  =  xi(o)  XOR  r^, 

•  cany  bit:  Ci(o)  =  x^o)  AND  iy#). 

Now,  applying  the  corresponding  parallel  operations  of  XOR  and  AND,  i.e.  the  symmetrical  difference  A 
and  intersection  n,  and  lifting  the  set  of  carry  bits  by  a  dilation  ©,  we  can  implement  parallel  addition 
by  the  following  recursive  equations: 


1.  Define  the  initial  states  of  images  of  sum  bits  and  carry  bits  (called  sum-bit  image  and  carry-bit 
image)  at  time  t,  as  : 


5(to)  =  X,  C(to)  =  R. 


(19) 
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2.  The  recursive  relation  between  the  states  of  the  sum-bit  image  and  carry-bit  imag?  at  two  adjacent 
time  intervals  is  then: 


5(ti+1)  =  S(U)  A  C(U)  =  S{U)  u  CM  u  S{U)  u  C(tj)  (20) 

C(U+i)  =  (5(ti)  nC(ti))©  A’1  =  S(U)uC(U)®A-1  (21) 

where  i  -  0, 1,2,...,  A  +  l,  and  the  elementary  image  A-1  is  used  to  shift  the  carry-bit  image  one 
bit  to  the  left  for  the  next  iteration. 

3.  After  a  maximum  of  k  +  1  iterations,  the  sum-bit  image  is  the  result  and  the  carry-bit  image  is 
the  null  image  <t>-. 

S(tk+l)  =  X  +  R,  (22) 

This  procedure  is  illustrated  in  Fig.  6(d).  The  result  of  parallel  addition  of  binary  numbers  with  a 

maximum  Ar-bit  word  size  is  obtained  after  k  +  1  iterations.  This  algorithm  can  be  implemented  in  the 

DOCIP  architecture  by  the  program  (instructions)  given  below.  Mi,  Mj,  and  M3  represent  the  three 
N  x  IV-bit  memories.  “X  -*  Mi”  denotes  “store  X  into  memory  Mi”.  Each  numbered  line  represents 
a  single  DOCIP  machine  instruction  for  one  value  of  i.  Comments  are  in  parentheses. 

•  Assume  start  with  X  in  Mi  (=  S(to))  and  R  in  Mi  (=  C(to))- 

•  First  to  k*  iterations: 

1.  STuS7-m3  (»5(touC(^)) 

2.  J?Tu  Ma  -  Ml  (a  s(u)  U  C(t<)) 

3.  STuSJuBJ—  M3  (=  S(ti) u £(t0) 

4.  u  M3  -» Mi  (=  S(t*+. 1)) 

3.  S3©  A-1  — *  M3  (=  C(t»+i)) 
where  i  =*  0, 1, 2, ...,  k  -  1. 


•  (k  +  1)**  iteration: 

1.  337 U  Mj  —  M3  (=  S(tfc)  u  C(tfc)) 
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2.  Ml  U  377  -  Mi  ('=  5(tfc)  u  C(tk)) 

3  iV/i  u  .V/3  — *  Out  (=  S(f*+ 1)  =  A"  +  fi). 

Tie  total  number  of  clock  cycles  for  the  execution  of  this  program  on  the  DOC  IP  machine  is 

t(k)  <  5*  +  3  =  0(k) 

which  is  independent  of  the  number  of  words  being  added. 

In  fact,  BIA  can  be  used  to  devise  a  parallel  form  of  a  conditionai-sum  adder  or  carry-lookahead  adder 
for  further  extracting  additional  parallelism,  and  the  execution  time  of  this  addition  can  be  reduced  to 
0(logik).  Obviously,  there  exists  a  tradeoff  between  execution  time  and  hardware  complexity.  This 
paper  concentrates  only  on  some  simple  algorithms  . 

3.2  Subtraction  of  Binary  row-coded  Numbers 

Let  the  output  of  the  parallei  subtraction  be  D  »  X  -  R  (e.g.  Fig.  7(a)-(c)).  To  realize  it,  we  hist 
consider  the  serial  binary  subtraction  of  2  binary  numbers  <U  =  x,  -  r<.  The  procedure  in  the  least 
significant  bits  x ,(„)  and  r,(0)  of  binary  subtraction  generates  a  difference  bit  d^0)  and  a  borrow  bit  b^0y 
The  boolean  logic  equations  for  subtracting  the  two  least  significant  bits  (half-subtractor)  are 

•  difference  bit:  sq,)  =  xq„)  XOR  r^y 

•  borrow  bit:  =  Tq„)  AND  r^y 

Now,  applying  the  corresponding  parallel  operations,  and  shifting  the  set  of  borrow  bits  by  a  dilation*  ©, 
we  can  implement  the  parallel  subtraction  as  follows: 

1.  Define  the  initial  states  of  images  of  difference  bits  and  borrow  bits  (called  difference-bit  image 
and  borrow- bit  image)  at  time  t,  as  : 

D(to)»X,  ff(to)  =  *.  (23) 

2.  The  recursive  relation  between  the  states  of  the  difference-bit  image  and  borrow-bit  image  at  two 
adjacent  time  intervals  is: 

D(ft+t)  =  D{ti)  A  B{U)  =  D{ti)  U  D(U)  u  B(U)  (24) 
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B{U+i)  =  {D(U)  n  B(ti))  0  A-1  *  D(r,)  u  B(U)  ©  A"1 


(25) 


where  i  =  0, 1,2,  ...,k  + 1,  and  the  elementary  image  A'1  is  used  to  shift  the  borrow-bit  image  one 
bit  to  the  left  for  the  next  iteration. 

3.  After  a  maximum  of  k  + 1  iterations,  the  difference-bit  image  is  the  result  and  the  borrow-bit  image 
becomes  the  null  image  <j>: 

=  X  -  R,  5(tfc+i)  =  4>.  (26) 

This  procedure  is  illustrated  in  Fig.  7(d).  The  result  of  parallel  subtraction  of  binary  numbers  with  a 
maximum  fc-bit  word  size  is  obtained  after  k  +■  1  iterations.  The  DOCIP  architecture  can  realize  this  by 
the  following  program  (instructions): 

•  Assume  start  with  X  in  Afi  (=  D(t0))  and  R  in  (=  B(t0)). 

•  First  to  fc4*  iterations: 

1.  Mt  U  —  Af3  (=  D(U)  U  lift)) 

2.  HTTu  M7  —  Ml  (=  D(ii)  U  B{u)) 

3.  37Tu  (=  Dfc+i)) 

4.  (S?3  0  A*1  —  M2  (=  ff(ti+i)) 
where  »  =  0, 1, 2, ...» Jfe  —  1. 

•  (*  +  1)A  iteration: 

1.  A/i  u  S7  -*  AT3  (=  D(tk)  U  BCuJ) 

2.  HTTu  Af2  -*  Mi  (=  5f?J)u  B(tk)) 

3.  U ^7 (=  D(tk+l)  =  X-R) 

The  total  number  of  dock  cydes  in  the  DOCIP  to  complete  this  subtraction  process  is 

<*)<4*  + 3  =  <?(*). 
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3.3  Multiplication  of  Binary  row-coded  Numbers 

Using  the  representation  illustrated  in  Fig.  5,  we  define  a  parallel  (matrix-constant)  multiplication  of 
an  image  set  of  binary  numbers  and  one  single  binary  number  X  ■  Rr,  and  parallel  (element-element) 
multiplication  of  two  image  sets  of  binary  numbers  X  x  R. 


I.  Matrix-Constant  Multiplication  X  ■  Rr 

Consider  an  image  X  (e.g.  Fig.  8(a))  comprising  X3/k  numbers  x,,t  =  1,2 ,...,N3[k,  and  a  reference 
image  Rr  (e.g.  Fig.  8(b))  comprising  only  one  single  fc-bit  binary  number  r  =  (r(*_1)r(jk_2)...r(0))2. 
The  output  of  the  parallel  multiplication  is  X  •  Rr  (Fig.  8(c)).  To  realize  it,  we  first  consider  the  serial 
multiplication  of  two  binary  numbers  that  is  the  sum  of  the  shifted  versions  of  the  multiplier  or  the 
multiplicand.  Then,  by  applying  the  corresponding  parallel  operations  and  parallel  shifting  by  a  dilation 
0,  we  can  implement  this  parallel  multiplication  by  the  equation 

X  Rr=  £  X  0  A"'  (27) 

where  the  sum  notation  £  refers  to  a  sequence  of  parallel  additions  and  the  parallel  addition  +  is  defined 
in  subsection  3.1. 

The  DOCIP  takes  0(k2)  clock  cycles  for  implementing  this  matrix-constant  multiplication.  Its 
procedure  involves: 

1.  Generating  the  term  A~*: 


•  The  DOCIP -array  requires  at  most  l  <  fc  —  1  =  O(fc)  clock  cycles,  because 

A’1  =  (A~lY 


s  A~l  0  A~l  0  ...  0  A~l 
N  11  ■»  / 
l 

X  0  A~l  =  (.~{{X  0  A~l)  0  A~l)  0  ...  0  A’1). 

l 


(23) 
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•  The  DOCIP -hypercube  requires  at  most  log^l  <  lo<ft(k  -  1)  =  0{logik)  clock  cycles,  because 
we  can  rewrite  l  as  a  binary  number  l  =  (a(l/on<J)-"ari)a(0))3’  and  we  have 

a-1  = 

s  A*°w  ©  A_a<1)-21  ©  ...  ©  A'^'-w'Jr21'**^  (29) 

X  ©  A~‘  =  (...((X®  A“*W)®  A’*™'21)®  ...© 

where  [lo#!]  is  the  greatest  integer  less  than  or  equal  to  logil,  and  each  dilation  with  A-’or2' 
can  be  implemented  in  the  DOCIP-hypercube  in  one  single  clock  cycle. 

•  The  total  time  delay  for  generating  all  required  X  ®  A~l,  0  <  l  <  k  -  1,  is  bounded  by  0(k) 
for  both  the  DOCIP -array  and  the  DOCEP-hypercube.  Since 

X  ®  A~l  =  (if  ©  A“^-1))  ©  A-1,  (30) 

we  can  generate  the  new  term  X  ©A_/  by  simply  deriving  it  from  the  previous  term  X©A-^-1) 
without  starting  from  the  original  X.  The  total  generating  time  is  then  dominated  by  the 
number  of  terms  X  ©  A~l  which  is  at  most  0(k). 

2.  Implementing  the  summation  £/,vr(l)»i  X  ©  A~‘: 

•  The  DOCIPs  require  at  most  k  -  1  =  0{k)  parallel  additions  to  implement  this  summation, 
and  each  parallel  addition  requires  at  most  i+l  =  0(k )  iterations  (as  shown  in  subsection 
3.1).  Since  it  takes  O(k)  time  for  generating  all  the  terms  X  ©  A"1,  the  total  execution  time 
of  the  DOCIPs  for  this  matrix-constant  multiplication  of  Ar-bit  binary  numbers  is 

0(k)  x  O(Jfe)  +  O(Jfc)  =  0(Jfe3). 

From  the  example  shown  in  Fig.  8,  Rr  *  I  U  A-1  contains  only  a  single  number  r  -  (0101)}  =  5,  and 
the  DOCIP  can  implement  this  matrix-constant  multiplication  X  ■  Rr  as  follows: 

Assume  start  with  X  in  M\  (®  X  ©  I). 

1.  Afi  ©  A"3  —  AT,  (=  X  ©  A"3) 

2.  The  instructions  of  the  parallel  addition  are  performed  as  shown  in  subsection  3.1: 

Mi  +  Mi  -  Out  (=  X  ■  Rr). 
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II.  Element-Clement  Multiplication  X  x  R 


Consider  an  image  X  (e.g.  Fig.  9(a))  comprising  N7/k  numbers  x,-,»  =  1,2,  ...,N2/k,  and  an  image 
R  (e.g.  Fig.  9(b))  comprising  N2/k  numbers  r,-,i  =  1,2, ...,  N2/k.  Tbe  output  of  tbe  element -element 
parallel  multiplication  is  X  x  R  (Fig.  9(c)).  Because  tbe  multiplication  of  two  binary  numbers  is  tbe 
sum  of  the  shifted  versions  of  tbe  multiplier  or  tbe  multiplicand,  applying  tbe  corresponding  parallel 
operations,  we  can  implement  this  parallel  multiplication  by  tbe  equation 


XxR  =  L'!-Z(X®A-‘)n((Rn(MQA-l))(BUkrl-'A->) 
=  E*Jo  X  ©  a-1  u  TL  U  M  ©  A-1  ©  U kjZo~lA~j 


where  tbe  mask  M  (Fig.  9(d))  is  used  to  extract  tbe  1th  bit  (where  tbe  O'*  bit  is  least  significant  and 
tbe  (k  -  1)**  bit  is  most  significant).  Tbe  DOCIPs  can  implement  this  element-element  multiplication 
by  tbe  procedure 


1.  Generate  X  ©  A~l  and  5u  M  ©  A~l: 


•  Using  an  argument  similar  to  that  in  subsubsection  I  above,  tbe  DOCIP-array  takes  O(k) 
time  and  tbe  DOCIP-hypercube  takes  0(lo<frk)  time. 

2.  Generate  K  u  Af©  ©  U^£~lA~*: 

•  Tbe  DOCIP -array  takes  0(k )  time,  because 

*  U  1 A-*  *  (  U  A~’)k~{~1  3  ( U  A-’)  ©  ( (J  A~’)  ©...©(  U  A->),  (32) 

jmO  jm  0  jmC  jmO  jmO 

k-l-l 

l  >  0,  and  each  dilation  by  a  term  in  parentheses  executes  in  one  clock  cycle. 

•  Tbe  DOCIP-hypercube  takes  0{logik)  time,  since 

k-l-l  n 

U  A-i-  n  (U  A~*wv)  (33) 

jmO  hO  jm 0 

where  k—l—l  =  (a(jj«<l(k-<-i)j)...a(i)a{o))ti  and  again  each  dilation  by  tbe  term  in  parentheses 
executes  in  one  clock  cycle. 
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•  It  takes  0(k)  time  for  the  DOCIP -array  and  O(log^k)  for  the  DOCIP-hypercube  to  generate 
the  term  ( X  ©  A-')  U  ((3?  u  (M  ©  A"1 ))  ©  u^^A'-7). 

3.  Implementing  the  summation  X  ©  A~l  ©  u*~q-1  A~;: 

•  The  summation  requires  at  most  (k  -  I)  addition  operations,  and  each  addition  operation 
takes  O(k)  time  on  the  DOCIP  system.  We  also  require  O(k)  time  for  the  DOCIP-array  and 
0(logik)  time  for  the  DOCIP-hypercube  to  generate  each  operand  of  the  addition.  Thus,  for 
this  element-element  multiplication  of  4-bit  binary  numbers,  the  total  computation  time  is 
0{k3)  for  the  DOCIP-array  and  0{k2lo<ftk)  for  the  DOCIP-hypercube. 

Multiplication  requires  more  than  three  memories.  This  can  be  accommodated  by  either  building 
more  memory  into  the  DOCIP  machine  or  by  swapping  intermediate  results  into  and  out  of  an  external 
memory.  In  the  latter  case  we  assume  the  external  memory  can  be  loaded  and  unloaded  with  one  image 
in  a  single  time  step.  In  section  4,  binary  stack-coded  arithmetic  also  requires  more  than  three  memories; 
we’ll  make  the  same  assumptions  on  the  use  of  an  external  memory. 

For  binary  column-coded  arithmetic,  a  number  is  encoded  in  a  part  of  a  column  of  an  image  as  in 
Fig-  10.  All  the  algorithms  derived  in  this  section  can  be  also  applied  to  binary  column-coded  numbers 
except  that  we  replace  the  elementary  image  A*1  by  a  different  elementary  image  B  for  shifting  the 
carry-bit  image  or  borrow-bit  image  in  the  vertical  direction. 

« 

4  Binary  stack-coded  Arithmetic 

In  this  case,  a  number  is  encoded  in  a  stack  of  k  image  planes  with  the  least  significant  bit  in  the  first 
plane,  next  least  significant  bit  in  the  second  plane,  etc.  (Fig.  11).  We  assume  all  numbers  including 
the  results  of  arithmetic  operations  can  be  represented  in  k  bits,  so  that  k  images,  each  with  IV  x  N  bits, 
contain  N2  binary  numbers.  Here,  we  describe  parallel  addition,  subtraction  and  multiplication  by  BLA 
expressions. 
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4.1  Addition  of  Binary  stack-coded  Numbers 

Using  the  representation  illustrated  in  Fig.  11,  we  consider  the  parallel  addition  of  two  sequences  of 
images  of  binary  numbers.  Assume  a  sequence  of  images  X  =  (X(*_t),  X(t_j),  ...,X(o))  (e.g.  Fig.  12(a)) 
storing  N 2  binary  numbers  r„i  =  1,2, ...,1V2,  and  a  sequence  of  images  R  =  (R^k~i),R{k~ 

(e.g.  Fig.  12(b))  storing  N2  numbers  r,-,  i  =  1,2,...,  IV2.  Then  the  output  of  the  parallel  addition  is 
X  +  R  =  5  =  (5(jt),  ...,5( Q))  as  shown  in  Fig.  12(c).  To  realize  this  addition  using  our  three 

fundamental  operations,  we  implement  an  array  of  full  adders  as  described  by  the  equations 


1.  The  least  significant  bit  planes  of  sum  bits  and  carry  bits  are  given  by: 


X(o)  A  12(o)  =  ^  ^  -8(0)  u  X(0)  u  12(o) 

(34) 

C(i)  =  X(0)  H  12(o)  =  -^(o)  u  -fyo) 

(35) 

2.  The  recursive  relations: 


S(i)  =  X(,-)  A  A  C(,-) 

*  (X(,-)  n  Rf,-)  n  £(q)  u  (X^  n  12^  n  C(,-))  u  (Xw  n  R^  n  C^)  u  (X(i)  n  ^  n  £(,)) 


C(*-i)  *  (^(i)n  •*(*))  u(^(0nC'(i))u  (R{i)  nCM) 

*  (^(0  u  -fy*)) u  (^(0  u  c(o) u  u  c(0) 

♦ 

where  *  «  0, 1, 2,  .1.,  k  -  1. 


(36) 

(37) 


3.  The  final  solution  is: 

X  +  R  =  5  =  (S(*),S(fc_i),...,5(0)).  (38) 

where  5(*j  a  C(*)  because  X(*)  = 

This  algorithm  can  be  implemented  in  the  DOCIP  architecture  by  the  program  (DOC1P  instructions): 
•  Assume  start  with  X(0)  stored  in  M\  and  12(0)  stored  in  Afj. 
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•  Calculate  and  C^y. 

1.  iETT u  Mi  —  M3  &  Out  (=  C(„) 

2.  37Tu  M2  —  Mi  (=T^U  Ji(0)) 

3.  A/;  U  Mj  U  -V/3  —  Mj  (s  Jf(0)  U  A(o)) 

4.  377 U  377  —  Out  (=  S(0)) 

•  Calculate  5^j  and  C(2): 

1-  Af(!)  —  Ml 

2.  377  U  M3  — *  Afj  (=  X(\)  U  C(i)) 

3.  Mx  U  A/3  — *  M\  {—  -X(i)  U  C(d) 

4.  Mi  u  Mi  — *  Mi  (=  A  C(i)) 

5.  Ryi)  — *  Mi 

6.  Mi  U  Mi  — < »  Mi 

7.  Mi  U  Mj  — *  Mi 

8.  Mi  U  M3  — *  Out  (= 

9-  A'd)  —  Mi 

10.  J2(1)  — *  Mi 

11.  M\  u  Mi  — »  M3 

12.  C'd)  -*  Mi 

13.  Mi  U  Mi  — *  Mi 

14.  Mi  u  Mi  —•  Mi 

15.  A{i)  Mi 

16.  Mi  U  Mi  — *  Mi 

17.  Mi  U  M3  — *  M3  Sc  Out  (*  C(2)) 

•  Calculate  5(3)  to  £(*-1)  and  to  C(*j: 

Use  the  same  instructions  for  calculating  5( i)  and  C(3)  except  that  -*€«)  and  J^i)  (and  5(i)  and 
C(v)  are  replaced  by  X(i)  and  %  (and  5(i)  and  C{<+d)  in  each  iteration,  and  in  the  beginning  of 
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an  iteration  the  memory  -W3  stores  instead  of  £(1),  «  =  2,3, 

The  complete  execution  of  this  operation  in  the  DOCIP  requires 

t(k)  <  17 (k  -  1)  +  4  *  17*  -  13  =  O(k). 

clock  cycles.  Additional  parallelism  could  be  extracted  to  further  reduce  the  execution  time  by  utilizing 
carry-lookahead  techniques  or  by  optimizing  the  above  program. 


4.2  Subtraction  of  Binary  stack-coded  Numbers 

Let  the  result  of  the  parallel  subtraction  be  X  -  R  =  D  =  (D(k_x),D( jt_2)>  — >-D(o))  (e.g.  Fig.  12(d)). 
To  realize  it  using  the  3  fundamental  operations,  we  consider  a  serial  full-subtractor.  Applying  the 
corresponding  parallel  operations,  we  can  implement  this  parallel  subtraction  by  the  equations 

1.  The  least  significant  bit  planes  of  difference  bits  and  borrow  bits: 

.D(o)  =  X(o)  A  Ji(0)  =  *(o)  U  u  i?(0)  u  X(0)  (39) 

5(1)  =  X(o>  n  ^<o)  =  -T(0)  u  JT(07  (40) 


2-  The  recursive  relations: 

£(,-)  =  (-^(o  n  ^(0  n  5(<))  u  n  n  B^)  u  ( X n  n  u  (X^  n  n 

=  C*(i)  <J  Jifi)  U  5(0)  U  (A(,-)  U  5^ U  B(i))  U  (^(j)  U  U  U  (X(,-)  U  U  4{i)) 

(41) 

5(i+i)  =  (^(«) n  ^(0  n  5(^)  u  (Jt(,-)  n  n  u  (X^  n  n  B^)  u  ( X n  n  j 3(i)) 

*  (XW  U  U  £(,-)')  U  (X(<)  u  Sj,"  u  B(i))  U  (Xtf  u  R(i )  U  U  (X(,-)  u  R^i)  u  3{i)) 

(42) 

where  »  =  0, 1, 2, ...,  *  -  1. 


3.  The  final  solution: 

X  -  R  =  D  =  (£(*_!),  i?(k_3),...,I>(0))-  C43) 


This  algorithm  can  be  implemented  in  the  DOCIP  architecture  by  the  program  (instructions): 


•  Assume  start  with  X(0)  in  Mi  and  Jt/0)  in  Mj. 

•  Calculate  £>(0)  and  5(1): 

1.  Mi  U  —  M3  k  Out  (=  E(i)) 

2.  MTu  Mj-*M,  (=X^u5(0,) 

3.  M7  U  M3  —  Out  (=  D^0)) 

•  Calculate  D(1)  and  B^y 

1.  X(I)  -  Mi 

2.  Mi  u  M3  — »  M^ 

3.  M\  u  M3  — »  Mi 

4.  R(\)  —  M3 

5.  Mj  U  M3  — »  M2 

6.  Mj  U  M3  — »  M3 

T.  Mj  u  333  -*•  Ml 

8.  iZ^x)  -*  M3 

9.  Mi  U  MJ  —  Mi 

10.  MTuM3  —  Mj 

11.  AT(x)  -►  M2 

12.  Mi  u  M3  — »  M3 

13.  5(i)  - *  Mi 

14.  Mi  u  M3  -  M3 

15.  Ma  u  MJ  -  Out  (=  Z?(1)) 

16.  ^"(i)  — *  M3 

ir.  37T  u  M3  —  Mi 

18.  Ji(i)  -►  M3 

19.  Mi  u  M3  — ►  Mi 

20.  MTu  M2  —  M3  k  Out  (=  5(2)) 
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•  Calculate  to  D[k~  1)  wtd  -5(3)  to  ^{k)- 

Use  tiie  same  instructions  for  calculating  and  Bp)  except  that  X^  and  5(i)  (and  and 
5(2))  are  replaced  by  *<0  and  5(,-)  (and  and  5(i+j))  in  each  iteration,  and  in  the  beginning 
of  an  iteration  the  memory  M3  stores  5(,-)  instead  of  i  =  2,3,..,*. 

Therefore,  the  total  execution  time  in  the  DOCIP  to  complete  this  parallel  subtraction  is 

<(*)  <  20(*  -  1)  +  3  =  20*  -  17  =  0(k). 

4.3  Multiplication  of  Binary  stack-coded  Numbers 

Let  the  result  of  the  parallel  multiplication  be  X  x  R  =  M  =  (M(2*_i),M(2*_2),  ...,M(0))  (e.g.  Fig. 
12(e)).  Since  binary  multiplication  is  equivalent  to  the  addition  of  shifted  versions  of  the  multiplicand, 
applying  the  corresponding  parallel  operations,  we  can  implement  the  parallel  multiplication  by  the 
equations 

5*°)  =  (0,0,  ...,Q,  X(k-i)  n  5(o) » -^(fc-J)  n  R{0),  ~,*(0) n  -5(o))  (+4) 

k 

P®  -  (Q,0,  ...,0,  X(h-i)  H  ■£(.•), -X(*-3)  FI  5(,'),  ...,X(oj  n  5(3,0,0,...,0)  (45) 

k-i  i 

Jr— 1 

X  x  5  =  M  =  Y.  p{i)  ■  1*0)  +  ^(1)  +  -  +  pik~l)  (46) 

imQ 

where  *  =  0,1, *  - 1,  and  the  addition  +  is  defined  in  subsection  41.  Since  this  parallel  multiplication 
requires  at  most  *  -  1  additions,  each  addition  takes  O(k)  time  for  the  DOCIP,  and  each  can  be 
generated  in  O(k)  time,  the  total  execution  time  is  0(*3). 

5  Symbolic  Substitution  and  Binary  symbol-coded  Arithmetic 

Symbolic  substitution  was  first  considered  as  a  means  of  utilizing  the  parallelism  of  optics  by  Huang  [17], 
Recently,  the  use  of  symbolic  substitution  as  a  basis  for  digital  optical  computing  has  been  reported  in 
[17]-(19]  [24]-(31].  Special  symbolic  substitution  rules  can  be  applied  to  perform  arithmetic  operations 
and  simulate  a  Turing  machine  [19].  Although  symbolic  substitution  demonstrates  the  ability  to  solve 
any  computable  problem  and  performs  many  operations,  we  will  formalize  symbolic  substitution  by  BIA 


27 


algebraic  symbols,  demonstrate  that  symbolic  substitution  rules  are  particular  BLA.  image  transforma¬ 
tions,  and  give  the  BIA  formal  notations  of  binary  symbol-coded  (symbolic  substitution)  arithmetic. 
We  show  that  the  symbolic  substitution  implementation  of  some  operations  is  relatively  complicated  to 
other  implementations. 

5.1  BIA  Representation  of  Symbolic  Substitution 

In  this  subsection  we  give  the  BIA  equation  for  symbolic  substitution  and  show  how  it  can  be  implemented 
on  the  DOCIP  machine.  A  symbolic  substitution  rule  involves  two  steps:  1)  recognizing  the  locations 
of  a  certain  search-pattern  within  the  2-D  binary  input  data,  and  2)  substituting  a  replacement-pattern 
wherever  the  search-pattern  is  recognized.  We  derive  it  by  BIA  in  the  following  steps  (illustrated  in  Fig. 
13): 

1.  BIA  Notations  for  Symbolic  Substitution: 

•  2-D  binary  input  data  =  image  (bit  plane)  X 

•  Symbol  to  be  recognized  (search-pattern)  =  reference  image  (or  image  pairs)  R 

•  Symbol  to  be  replaced  (replacement-pattern)  =  reference  image  Q 

2.  A  Symbolic  Substitution  Rule: 

•  Step  1.  recognition  of  the  search-pattern: 

(a)  Foreground  recognizer  the  locations  of  a  certain  spatial  search-pattern  R\  (defined  by 
its  foreground)  within  the  foreground  of  the  2-D  input  data  X  can  be  recognized  by  the 
erosion  operation  of  X  and  R\. 

X  e  R\  =  1t®R\.  (47) 

(b)  Background  recognizer  the  locations  of  a  certain  spatial  search-pattern  R?  within  the 
background  of  the  2-D  input  data  X  can  be  recognized  by  the  erosion  of  7  and  R?  : 

7 6  Rj  =  X  ©  Ri.  (48) 


(c)  Fall  recognizer  by  combining  the  two  above  steps,  the  locations  of  a  certain  spatial 
search-pattern  R  =  (Ri,Ri)  (R\  defines  the  foreground,  and  Rj  defines  the  background) 
within  the  2-D  input  data  X  can  be  recognized  by  the  hit  or  miss  transform  of  X  and  R: 


x®R  =  (xe  Ri)  n  (Xe  fa)  =  tfeAi)  u(x$  r3).  (49) 

•  Step  2.  substitution  of  the  replacement-pattern: 

-  Substituter  a  new  replacement- pattern  Q  can  be  substituted  for  R  wherever  the  search- 
patters  R  is  recognized  by  the  dilation  of  X  ®  R  by  Q. 

•  Synthesis: 

-  A  complete  symbolic  substitution  rule  is  implemented  by  the  hit  or  miss  transform  of  X 
by  R  followed  by  the  dilation  by  Q: 

(X®  R)  ©  <3  =  ((X©  Rx)  n  (X©  *a))  ®  Q  =  (* ®  Rx)  U  (X®  R7)  ©  Q.  (50) 

•  Optional  masking: 

-  An  optional  mask  M  can  be  used  for  controlling  the  block  search  region.  A  symbolic 
substitution  rule  can  be  modified  as: 

\ 

{(X®R)nM)&Q.  (51) 

By  proper  choice  of  M,  the  search  can  be  made  in  overlapping,  disjoint  or  non-conti  guoys 
blocks. 

3.  A  symbolic  substitution  system  (Fig.  14): 

•  To  work  with  more  than  one  rule  (say  p  substitution  rules)  for  practical  applications,  a 
symbolic  substitution  processor  produces  several  copies  of  the  input  X,  provides  p  different 
recognizer- substituter  units,  and  then  combines  the  outputs  of  various  units  to  form  a  new 
output.  Thus,  a  symbolic  substitution  system  is  implemented  by 

UOTQjjWjegw  (52) 


where  and  Q^'\  i  -  l,2,...,p,  are  the  reference  image  pairs  and  replacement  patterns 
in  the  iih  symbolic  substitution  rule.  This,  then,  is  the  BIA  formula  for  general  symbolic 


substitution. 

Hence,  a  general  mathematical  formalism  of  symbolic  substitution  has  been  developed.  For  a  local 
search-pattern  and  replacement-pattern  (i.e.  Hi.Hj.Q  C  N array  or  Nkypercube),  the  ' OCIP -array  or 
DOCIP-hypercube  can  implement  a  symbolic  substitution  rule  in  four  (or  five  with  the  optional  mask) 
steps: 

Assume  start  with  X  in  M\. 

1.  M\  ©  Hr  —*  M2 

2.  Mi  ®  Hj  — •  M3 

3.  M2  U  M3  —  M3 

4.  —  Out(=(X@H)©$) 

Let  the  pixels  used  in  the  substitution  rule(s)  of  a  symbolic  substitution  processor  be  the  neighborhood, 
NUl  of  the  processor.  We  see  from  the  above  steps  that  the  DOCEP  can  simulate  the  symbolic  substitu¬ 
tion  processor  in  constant  time  if  the  two  machines  have  the  same  neighborhood.  If  N„  is  not  a  subset 
of  the  BOCIP  neighborhood,  then  the  simulation  will  take  longer.  In  either  case,  it  is  not  presently 
known  how  many  steps  it  takes  the  symbolic  substitution  processor  to  simulate  the  DOCIP. 

5.2  Binary  symbol-coded  (Symbolic  Substitution)  Arithmetic 

A  bit  in  a  binary  number  is  encoded  in  symbolically  as  pixels  of  an  image  (Fig.  15).  In  this  subsection, 
we  primarily  concentrate  on  simple  intensity  coding:  a  logic  value  (0  or  1)  is  represented  by  a  single 
pixel  (dark  or  bright)  (Fig.  15(a)),  as  in  the  binary  row  and  stack-coded  number  representations,  but 
the  operands  of  binary  numbers  x,  and  r,-  are  stored  in  the  same  input  image  X  as  shown  in  Fig.  16(a). 
The  expected  output  images  of  symbolic  substitution  for  binary  addition  and  binary  subtraction  are 
shown  in  Fig.  16(b)-(c).  To  achieve  these  desired  operations,  the  symbols  associated  with  the  operands 
are  recognized  and  then  replaced  by  new  symbols  associated  with  the  results  of  the  operation.  Systems 
for  implementing  binary  addition  and  subtraction  are  formalized  and  illustrated  as  examples  of  binary 
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symbol-coded  arithmetic  below. 


5.2.1  Addition  of  Binary  symbol-coded  Numbers 

This  parallel  binary  addition  (Fig.  17)  can  be  implemented  with  four  symbolic  substitution  rules  (Fig. 
17(a))  [17]  [18].  In  the  case  of  simple  intensity  coding,  as  we  will  show,  Rule  1  is  not  necessary.  The 
symbolic  substitution  system  for  simple  intensity  coding  can  be  realized  as 


y(to)  =  x 

(53) 

=  U((y(*i)® R(0) n  AO  ©  Qli) 

(54) 

where  y(t*+i)  is  the  result,  j  =  0, 1,2,  ...,k  +  1,  k  is  word  size  (i.e.  the  number  of  bits 

R^  —  (R^\R^^)  and  are  shown  in  Fig.  17(b)  and  represented  as 

in  a  operand); 

1.  =  *,  4X)  =  ULo-8’.  <3(1)  = 

2.  R[2)  =  /,  4J)  =  B,  QW  =  /, 

3.  R[3)  =  B,  43)  =  I,  QW  =  I, 

4.  =  ULo  B\  44)  »  g(4)  =  A~lB. 

Here  the  null  image  <j>  and  the  elementary  images  are  as  defined  in  subsection  2.1;  the  mask  M  (Fig. 

17(c)),  used  for  controlling  the  block  search  region,  is  the  image  corresponding  to  the  coordinates  of 

the  origins  (lower-lefter  pixels)  of  the  input  symbols  in  the  input  image  X.  An  example  is  given  in  Fig. 

17(d).  Note  that  QW  =  <b  implies 

((Y(tj)®RW)nM)eQM  =+, 

(55) 

so  that 

Y(t» i)  * 

*  ®  *(0)  n  M )  9  Q& 

*  UiattfU)  ®  ^t(0)  u  PM  9  A,10)  n  It)  9  Q®. 

(56) 
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Thus,  for  simple  intensity  coding  of  symbolic  substitution,  we  can  reduce  the  four  rules  of  binary  addition 
to  only  three  rules.  However,  this  reduction  of  complexity  cannot  be  applied  to  dual-rail  or  six-pixel 
coding. 

When  implemented  on  the  DOCIP,  this  addition  requires  at  most  Jfc  +  1  iterations,  each  iteration 
requiring  two  union  operations  of  three  results  of  symbolic  substitution  rules,  and  each  rule  is  realized 
within  live  steps  as  shown  in  subsection  5.1.  Thus,  the  total  execution  time  in  the  DOCIP  is 

t(k)  <(3x5  +  2)(*  +  1)  =  17(*  +  1)  =  O(k). 

When  using  2  or  6  pixels  to  represent  a  logic  value  (Fig.  15(b)-(c)),  we  can  formalize  symbolic  substitution 
addition  as 

•  Dual-rail  coding  (Fig.  15(b))  [19][20]:  we  can  implement  a  full  recognition  with  only  a  background 
recognizer  (or  foreground  recognizer) 

m+i)  =  ULi((y(*;)®*(<))nAf)©QM 

*  UtiCtfU)  ©  Rp])  u  (V(t,-)  ©  Ap1)  nM)e  Q(C>  (57) 

=  ULi(r(«i)  ©  Ri(<)  n  Af)  © 

where  j  =  0,1,2,...,*;  Q®  8X8  shown  in  Fig.  18(a)  and  represented  by 

elementary  images  as 

1.  R[l)  =  I U  fla,  4!,  =  SU B3,  g<l>  *  /U  A~lB3, 

2.  r[2)  =  ULX  B',  4J>  =  /U  B3,  QW  =  B  U  A~lB3, 

3.  43)  *  /U  S3,  R^  =  lg.i  B\  QW  =  B  U  A~lB3, 

4.  44)  *  B  U  B 3,  Rif'  =  IuB3,  QW  =/u  A~lB3; 

and  the  mask  M  is  shown  in  Fig  18(b).  Since 

u  (y(tj)  ©  ji,(i))  =  (F(<i)  ©  &i{{))  =  (y(tj)  ©  (53) 

for  the  dual-rail  coding,  can  be  represented  by  only  its  foreground  background  dp. 

For  implementation  on  the  DOCIP,  this  algorithm  requires  four  rules,  and  each  rule  involves  two 
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dilations  and  one  union  or  intersection.  Because  they  may  be  not  included  in  Narray  or  iV*ypereu4e, 
each  dilation  of  R^  or  is  implemented  by  2-4  steps  for  the  DOCIP-arrayS  and  1-2  steps  for 
the  DOCIP-hypercube8.  The  total  execution  time  is  bounded  by  28 (k  +  1)  for  the  DOCIP-arTay8 
and  18(fc  +•  1)  for  the  DOCIP-hypercubeS.  Moreover,  it  requires  more  difficult  dual-rail  coding  and 
doubles  the  device  area. 

•  Six-pixel  coding  (Fig.  15(c))  [30]:  the  mask  M  is  not  needed  and 

i'(^i)=U(y(ii)®A(,))®«(0  (59) 

ta»l 

where  j  =  0, 1, 2, ...,  k,  k  is  the  word  size;  =  (R[1\r!^)  and  are  shown  in  Fig.  21  and  are 
represented  as 

1.  r[1}  *  ruABlsB2lsAB3,  4X)  =  Blll)B3llAll  AB2u({jfm0  A2B'),  =  A~3B2uA~2B3u 

I U  AB, 

2.  S[2)  =  (U2-!  B*)uAuA3,  R^]  =  In  ^UdjLi  ABi)u{\Jm0A2Bi),  Q™  =  A~3B2uA~2B3\j 
Bis  A, 

3.  R[3)  =  /uB3U((JLi  AS’),  R^3)  *  (U^  S’)UAUA3U(U3B<JAJ5‘),  Q<3>  =  A~3S2UA~253U 
Bis  A, 

4.  R^  =  Bl)B3liAuAB2,Ri?)  a/uS,UAJuAB3U(UL)AJB*),Q(4)  =  A-3S3uA'jS2u 
/u  AB. 

The  six-pixel  coding  removes  the  need  for  the  mask  M ,  but  requires  more  difficult  encoding,  more 
difficult  implementation  of  the  hit  or  transform  by  R^  and  dilation  by  Q^,  and  six  times  the 
hardware  area.  Addition  on  the  DOdP-array  or  DOCIP-hypercube  using  six-pixel  coding  takes 
much  more  time  (on  the  order  of  ten  times)  than  simple  intensity  coding  or  dual-rail  coding. 

5.2.2  Subtraction  of  Binary  symbol-coded  Numbers 


Similar  to  addition,  we  generally  use  4  symbolic  substitution  rules  (Fig.  20(a)),  but  Rule  1  and  Rule  4 
are  not  necessary  for  simple  intensity  coding.  The  symbolic  substitution  system  using  simple  intensity 


coding  for  binary  subtraction  can  be  realized  as 


Y(to)  =  X  (60) 

=  ULi((Y(i:)G^)nAf)6<S^ 

=  U^i((F(t})  ©  5i(,))  u  (Y(tj)  e  R*(,))  n  M)  ©  <?(•■)  (6i) 

=  UL2((F(t,)  ®  R^)  u  (Yfa)  e  IF)  n  M)  ®  <?(*> 

where  y(i*+l)  is  the  result  of  the  subtraction,  j  =  0,1,2,...,*,  *  is  word  size  (i.e.  the  number  of  bits  in 

a  operand);  R «  =  (A^R*0)  and  <3<*>  are  shown  in  Fig.  20(b)  and  represented  as 

1.  r[1)  =  *  4J)  =  uLo^-,  g(1)  = 

2.  a|3)  =  B-\  42)  =  A  <3(3)  =  /u 

3.  a{3)  =  /,  43)  =  Q<3>  =  /, 

4.  =  g(4)  =  * 

where  the  null  image  4  and  the  elementary  images  are  as  defined  in  subsection  2.1;  and  the  mask  M 
(Fig.  20(c))  is  a  shifting  of  the  mask  for  binary  addition.  Because  and  are  null  images,  and  the 
dilation  of  a  null  image  is  a  null  image.  Rule  1  and  Rule  4  are  not  needed  for  simple  intensity  coding. 
Fig.  20(d)  gives  an  example.  The  execution  time  for  the  DOCIP  is 

*(*)<  U(Jfe+l)*0(Jb). 

Similar  to  binary  addition,  we  can  develop  symbolic  substitution  binary  subtraction  algorithms  with 
BIA  representations  for  coding  a  symbol  with  two  or  six  pixels.  However,  four  symbolic  substitution 
rules  are  still  required  because  and  will  not  be  equal  to  the  null  image.  The  DOCIPs  take 
approximately  the  same  execution  time  for  binary  subtraction  using  dual-rail  or  six-pixel  coding  as  for 
binary  addition. 

6  Complexity  of  Parallel  Optical  Binary  Arithmetic 

We  have  shown  that  BIA  offers  a  general  tool  for  mapping  serial  binary  arithmetic  into  different  forms 
of  parallel  binary  arithmetic  (including  binary  row(or  column)-coding,  binary  stack-coding,  and  three 
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coding  techniques  for  symbolic  substitution  arithmetic)  in  a  precise  and  compact  way.  The  complexity 
of  parallel  addition  and  subtraction  of  two  N  x  N  arrays  of  binary  numbers  (each  number  with  k- bit 
length)  for  these  different  number  representations  are  compared  in  Table  1  and  Table  2.  Binary  row(or 
column)-coded  arithmetic  requires  the  smallest  number  0  of  fundamental  operations.  Binary  stack- 
coded  arithmetic  requires  the  lowest  number  of  processing  elements  (or  cells)  P  and  the  smallest  overall 
0  x  P  complexity  (assume  each  parallel  fundamental  operation  corresponds  to  P  processing  elements 
executing  in  parallel).  For  the  normal  case  in  which  the  word  size  is  larger  than  one  and  much  smaller 
than  the  image  size  (1  <  k  <  jV),  binary  row(or  column)-coded  arithmetic  can  be  implemented  in  the 
DOCIP  with  the  fastest  computation  speed  (assume  the  DOCIP  can  input  all  operands  In  an  image 
at  a  time).  The  complexity  of  binary  symbol-coded  (symbolic  substitution)  arithmetic  in  general  is 
in  all  cases  higher  than  that  of  binary  row(or  column  ;-coded  and  binary  stack-coded  arithmetic.  For 
implementing  symbolic  substitution  algorithms  on  the  DOCIPs,  the  simple  intensity  coding  is  superior 
to  the  other  symbol  coding  techniques. 

7  Conclusion 

Binary  image  algebra  (BIA)  is  demonstrated  to  be  a  general  technique  for  developing  and  formulating 
parallel  numerical  and  non- numerical  computation  algorithms  for  digital  optical  computers.  The  DOCIP 
is  a  simple  optical  architecture  for  effectively  implementing  BIA.  Symbolic  substitution  is  a  subset  of 
BIA  and  can  be  formalized  in  compact  BIA  expressions.  Three  different  techniques  for  parallel  optical 
binary  arithmetic,  based  on  binary  row(or  column-coding,  binary  stack-coding,  and  binary  symbol¬ 
coding  (symbolic  substitution) ,  are  illustrated  for  implementation  on  the  DOCIP.  Binary  row-coding 
arithmetic  has  fast  DOCIP  execution  and  binary  stack-coding  arithmetic  requires  the  lowest  number  of 
computations  0  x  P.  In  summary,  BIA  and  the  DOCIP  represent  \  simple  yet  powerful  parallel  digital 
optical  algorithmic  and  architectural  technique  for  both  numerical  and  non- numerical  applications. 
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Figure  1:  An  example  of  fundamental  operations:  complement  union  U,  and  dilation  ©. 


Figure  2(a):  Difference. 


Figure  2(b):  Intersection. 
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R 

Figure  2(d):  Symmetric  difference. 
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M 


Figure  2(e):  Hit  or  miss  transform  (template  matching). 


Figure  2:  Some  standard  derived  image  operations.  The  shaded  regions  in  (a)-(d)  correspond 
to  pixels  with  value  1.  r 
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Figure  3:  A  digital  optical  cellular  image  processor  (DOCIP)  architecture  —  one  implementa¬ 
tion  of  binary  image  algebra  (BIA).  The  DOCIP-array  requires  9  (or  5)  control  bits  for  reference 
image  E,.  The  DOCIP -hyper cube  requires  O(logN)  control  bits  for  reference  image  E,. 
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Figure  4:  An  optical  4-directed  or  8-directed  cellular  hypercube  (DOCIP-hypercube4  or  DOCIP - 
hypercubeS).  Each  ceil  connects  with  cells  in  the  4  directions  or  3  directions  at  distances 
1,2, 4,3, ...,  2*  from  itself  by  optical  3-D  interconnections. 
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Figure  5:  Binary  row-coded  numbers. 
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(a):  An  image  X  of  operands.  (b):  An  image  R  of  other  operands. 


(c):  The  outpur  X  -h  R. 
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(d):  The  procedure  for  parallel  addition  X  +  R. 


Figure  6:  Parallel  addition  of  binary  row-coded  numbers. 
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(a):  An  image  X  of  operands. 


(b):  An  image  R  of  other  operands.  (c):  The  output 
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(d):  The  procedure  for  parallel  subtraction  X  -  R. 

Figure  7:  Parallel  subtraction  of  binary  row-coded  numbers. 
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Figure  8.  Parallel  (matrix-constant)  multiplication  of  binary  row-coded  numbers. 
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(b):  An  image  R  of  other  operands.  (c):  Tie  output  X  x  R. 
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Figure  9:  Parallel  (element-element)  multiplication  of  binary  row-coded  numbers. 
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Figure  10:  Binary  column-coded  numbers. 


Figure  11:  Binary  stack-coded  numbers.  *«(m)  represents  the  mtf>  bit  of  the  «,A  number  in  the 
image  plane.  X (0)  represents  the  image  plane  of  least  significant  bits  and  X(*_i)  represents  the 
image  plane  of  most  significant  bits. 
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Figure  12(a):  A  sequence  of  images 
X  =  (^(3).  Jf(2),  AT(1),  Jf(0)). 


Figure  12(c):  The  sum  X+R  = 

(•^(4)*  S(Z),  5(2),  5(j),  5(0, )• 


Figure  12(b):  A  sequence  of  images 
R  ~  (R(3),R(2),R(x),R{q)). 


Figure  12(d):  The  difference  D  =  }, 
(Z>(3),  Z>(3),  0(1),  X>(0)). 


Figure  12(e):  The  product  M  »  X  x  R  = 

(M(7),  M(6),...,A/(0)). 


Figure  12:  Parallel  arithmetic  with  binary  stack-coded  numbers. 
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Figure  13:  BIA  representation  of  symbolic  substitution.  The  optional  mask  M  is  tor  controlling 
the  block  a  each  region. 
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Figure  14:  A  symbolic  substitution  system  with  p  symbolic  substitution  rules. 
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Figure  15(a):  The  simple  intensity  coding  of 
zero  and  one  (a  bit  is  a  pixel). 
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Figure  15(b):  The  dual-rail  coding  of  zero  and 

Figure  15(c):  The  six-pixel  coding  of  zero  and 

one  (a  bit  is  encoded 
from  [18][19]). 

as  two  pixels)  (adapted 

one  (a  bit  with  value  zero  or  one  is  encoded  as 
six  pixels)  (adapted  from  [30]). 

Figure  15:  A  bit  encoded  as  a  symbol, 
k-bit 


Figure  16(a):  The  input  image  X  contains  the 
operands  x,-  and  the  other  operands  r,-. 
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Figure  16(c):  The  output  of  parallel  subtrac- 
Figure  16(b):  The  output  of  parallel  addition.  tion. 

Figure  16:  Binary  symbol-coding  (symbolic  substitution)  binary  arithmetic). 
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Figure  17(a):  Four  symbolic  substitution  rules  for  addition. 
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a(3)  =  B,  Rp  =  /  <?(3,  =  /  *<4,  =  uU*\44)  =  *  QW  =  A'1B 

uiT«K<!’A  P,“"  “dJe?'“Ce  *“«-  •  =  1,2, 3, 4,  for  add,. 

V  is  a  null  image.  Rule  1  is  not  needed  for  this  simple  intensity  coding. 


Figure  17(c):  The  mask  Af. 


k-5  bits 
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10100 

-10011 

Rule  i. 

Difference  bits 

00111 

Rule  2. 

Borrow  bits 

.floau- 

Difference  bits 

00001 

Rule  3. 

Borrow  bits 

0011 

Difference  bits 

ooodi 

Rule  4. 

Borrow  bits 

000 

0 


0 


1 

1 


1 

0 


0 

0 


Figure  20(a):  Four  symbolic  substitution  rules  for  subtraction. 


R{1}  =  R[l)  =  UU  B" 


Origin 


g(U  =  o 


=  I  Q<2»  =  /uA-1S~1 


Rule  3. 


R™  =  /,  R(?]  =  B-1 


Q(3)  =  / 


Rule  4. 


=  U.U  B~\  Rl*]  =  * 


Figure  20(b):  Reference  image  pairs  R[>)  and  reference  images  Q{'\  i  =  1,2.3, 4,  used  for  sub¬ 
traction.  Because  Q(l)  and  are  nuU  images,  Rules  1  and  4  are  not  needed  for  simple  it- tensity 

coding. 


k-5  bits 


Figure  20(d):  An  example  of  parallel  subtraction  of  binary  symbol-coded  numbers. 


Figure  20:  Parallel  subtraction  of  binary  symbol-coded  numbers. 
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Number 

Representation 

Binary 

Row-coding 

Binary 

Stack -coding 

Symbolic 
Substitution 
(simple  intensity 
coding) 

Symbolic 
Substitution 
(dual-rail  coding) 

No.  of  Dilations 
(or  Erosions) 

k 

0 

9(k+1) 

8(k+1) 

No.  of  Unions 
(or  Intersections) 

4k +3 

16k-12 

15(k+1) 

16(k+1) 

No.  of 

Complements 

7k +4 
*2k+2 

20k- 13 
*7k-5 

12(k+1) 
*3(K+1 ) 

16(k+1) 

'4(k+1 ) 

Total  No.  of  Parallel 
Fundamental  O 
Operations 

12k+7 

*7k+5 

36k-25 
•23k- 17 

36(k+1) 

*27(k+1) 

40(k+1) 

*28(k-t-1) 

No.  o! 

Processing  P 

Elements 

kN2 

N2 

2kN2 

4k  N2 

Total  No.  of 
Computations  ®xP 

(12k+7)kN* 

*(7k+5)kN* 

(36k-25)N2 

*(22k-16)N* 

76k(k+1)hP 
*54k(k+1  )N* 

IBOkfk+IJN2 

*112k(k-^1)M» 

DOCIP  T 

Execution  Time 

5k +3 

17k-13 

1 7(k+1 ) 

18(k+1)  or 
28(k+1) 

PxT 

(5k+3)kN* 

(17k-13JN* 

34k(k+1)N* 

72k(k+1)N2  or 

1 12k(k+1  (N2 

'  indicates  the  number  of  operations  when  erosion  and  intersection  are  also  allowed. 


Table  1 .  Complexity  of  parallel  optical  binary  addition  of  two  NxN  arrays  of  k-bit  binary  numbers. 
Each  parallel  fundamental  operation  corresponds  to  P  processing  elements  executing  in  parallel. 


Number 

Representation 

Binary 

Row-coding 

Binary 

Stack-coding 

Symbolic 
Substitution 
(simple  intensity 
coding) 

Symbolic 
Substitution 
(dual-rail  coding) 

No.  of  Dilations 
(or  Erosions) 

k 

0 

6(k+1) 

8(k+1) 

No.  of  Unions 
(or  Intersections) 

4k +3 

16k- 12 

10(k+1) 

16(k+1) 

No.  of 

Complements 

6k+4 

*3k+2 

22k- 18 
*1  Ik-8 

8(kV.) 

2(K+1) 

Total  No.  of  Parallel 
Fundamental  0 
Operations 

11k+7 

*8k+5 

43k-33 

*33k-26 

24(k+1 ) 
*18(k-t-1) 

40(k+1) 

*28(k+1) 

No.  of 

Processing  P 

Elements 

kN* 

N* 

2  k  N* 

4kN* 

Total  No.  of 
Computations 

(11k+7)kN* 

*(8k+5)kN* 

(43k-33)N* 

•(33k-28)N» 

48k(k+1)N* 

*36k(k+1)N* 

160k(k+1)N* 

*112k(k+1)N* 

DOCIP  T 

Execution  Time 

4k +3 

20k- 17 

11(k+1) 

1 8(k+1 )  or 
28(k+1) 

PxT 

(4k+3)k  N* 

(20k- 17)1* 

22k(k+ 1  )N* 

72k(k+1)N*  or 
112k(k+1)N* 

'  indicates  the  number  of  operations  when  erosion  and  intersection  are  also  allowed. 


Table  2.  Complexity  of  parallel  optical  binary  subtraction  of  two  NxN  arrays  of  k-bit  binary  numbers. 
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2.1  Binary  Image  Algebra 


Binary  image  algebra  (BIA)  forms  the  mathematical  background  for  software  and  hardware 
systems  suitable  for  optical  digital  computing.  Parallel  algorithms  for  optical  cellular  logic  and 
symbolic  substitution  processors  can  be  formalized  as  compact  BIA  expressions.  BIA  also  leads  to 
the  architectural  design  of  digital  optical  cellular  image  processors  (DOCIP)  which  are  well-suited 
to  executing  the  parallel  algorithms.  The  following  paper  “An  Image  Algebra  Representation  of 
Parallel  Optical  Binary  Arithmetic”  submitted  to  Applied  Optics  summarizes  our  recent  work  in 
this  area. 
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Abstract 


Is  there  a  simple  unified  consistent  complete  theory  of  parallel  binary  digital  image  processing 
algorithms  and  architectures?  Can  this  theory  suggest  new  parallel  architectures  for  image 
processing?  Can  these  architectures  be  implemented  by  optical  computing  techniques?  We 
attempt  to  answer  these  questions. 

Techniques  for  digital  optical  cellular  image  processing  are  presented.  A  binary  image  algebra 
(BlA),  built  from  only  five  elementary  images  and  three  fundamental  operations,  serves  as  its 
software  and  leads  to  a  formal  parallel  language  approach  to  the  design  of  parallel  binary  image 
processing  algorithms.  Its  applications  and  relationships  with  other  computing  theories  demon¬ 
strate  that  BlA  is  a  powerful  systematic  tool  for  formalizing  and  analyzing  parallel  algorithms. 
Digital  optical  cellular  image  processors  (DOCIPs),  based  on  cellular  automata  and  cellular 
Logic  architectures,  serve  as  its  hardware  and  implement  the  parallel  binary  image  processing 
tasks  naturally  and  effectively.  An  algebraic  structure  provides  a  link  between  the  algorithms 
of  BIA  and  architectures  of  DOCIP.  Optical  computing  suggests  a  more  ideal  and  efficient  im¬ 
plementation  of  the  DOCEP  architectures  because  of  its  inherent  parallelism  and  3-D  global  free 
interconnection  capabilities.  Finally,  the  instruction  set  and  the  programming  of  the  DOCIPs 
are  illustrated. 
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1  Introduction 

In  this  paper  we  combine  stndies  of  architectures,  algorithms,  mathematical  structures,  and  op¬ 
tics  to  show  that:  1)  an  image  algebra  extending  from  mathematical  morphology  [2]-{5]  can  lead 
to  a  formal  parallel  language  approach  to  the  design  of  image  processing  algorithms;  2)  cellular 
automata  are  appropriate  models  for  parallel  image  processing  machines  [6][7];  3)  an  algebraic 
structure  serves  as  a  framework  for  both  algorithms  and  architectures  of  parallel  image  process¬ 
ing;  and  4)  the  parallel  processing  and  global  interconnection  advantages  of  optical  computing 
may  be  useful  in  efficiently  implementing  image  algebra  with  cellular  logic  architectures. 

The  purpose  of  the  image  algebra  approach  in  this  paper  is  for  the  development  of  a  pro¬ 
gramming  language  for  a  specific  parallel  architecture,  namely  a  digital  optical  cellular  image 
processor  (DOCIP).  The  binary  image  algebra  (BIA)  described  here  is  based  on  a  set  of  three 
specific  fundamental  operations.  These  fundamental  operations  are  the  key  operations  in  the  in¬ 
struction  set  of  the  DOCIP  machine.  The  BIA  provides  a  decomposition  of  general  operations, 
including  low-level  image  processing  operations,  into  the  three  fundamental  operations  of  the 
instruction  set.  This  decomposition  is  inherently  parallel  and  provides  a  direct  mapping  to  the 
machine  architecture. 

In  this  section,  we  first  review  previous  work  on  image  algebra,  cellular  automata  and  cellular 
logic  architectures,  then  define  the  algebraic  structure  and  outline  the  detailed  discussion  that 
follows. 

Previous  Work  on  Image  Algebra 

During  the  past  few  years,  numerous  papers  have  used  an  algebraic  approach  to  aid  in 
image  processing  [2]-[5]  [8]-(l0].  Among  them,  morphological  image  algebra  has  the  closest 
relation  to  binary  image  algebra  (BIA).  Many  papers  describe  either  specific  theoretical  aspects 
of  mathematical  morphology  or  application-specific  morphological  algorithms  [11]-[18].  The 
applications  of  mathematical  morphology  has  been  fruitful.  In  this  paper  we  adapt  it  to  provide 
the  following  features: 

1.  A  simplified  mathematical  structure.  Mathematical  morphology  comprises  two  branches, 
integral  geometry  and  geometrical  probability,  plus  a  few  collateral  ancestors  (harmonic 
analysis,  stochastic  processes,  algebraic  topology)  [2].  The  mathematical  details  and  formal 
proofa  in  morphology  are  often  intricate  and  involve  advanced  set  theoretic  and  topological 

,  concepts  which  are  not  always  necessary  for  engineering  applications. 

2.  A  complete  algebraic  theory.  Mathematical  morphology  defines  some  algebraic  operators 
and  utilizes  some  algebra.  With  our  adaptation,  we  would  like  to  answer  the  following 
questions: 

•  What  is  the  algebraic  definition  of  this  mathematical  morphology? 

•  How  powerful  is  this  mathematical  morphology? 

•  What  is  the  definition  of  a  transformation?  Morphological  transformations  are  con¬ 
strained  by  four  principles  [2],  here  we  introduce  a  complete  definition  of  image  trans¬ 
formations. 

3.  Clarification  of  its  relationship  to  other  areas.  We  define  its  relationship  to  linear  sys¬ 
tem  theory,  image  processing,  and  common  computing  techniques  including  boolean  logic, 
cellular  logic,  and  algebraic  structures. 
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Here  we  develop  a  simple  unified  complete  parallel  binary  image  processing  theory  based  on 
an  algebraic  structure  —  binary  image  algebra  (BLA).  In  BLA,  parallel  binary  image  process¬ 
ing  algorithms  (including  parallel  numerical  computations)  can  be  written  as  compact  algebraic 
expressions  where  an  algebraic  symbol  represents  an  image  (not  a  pixel)  or  an  image  operation 
(not  a  pixel-wise  operation).  A  complete  algebraic  system  comprises  three  fundamental  oper¬ 
ations  and  five  elementary  images  which  can  be  combined  to  generate  any  image  in  the  three 
fundamental  operations  tor  forming  any  image  transformation.  (In  fact,  one  can  define  four 
elementary  images  and  two  fundamental  operations  that  are  sufficient:  however,  in  this  paper 
we  will  not  consider  them  since  they  are  more  difficult  to  use.) 

There  are  other  image  algebras,  each  with  its  own  characteristics  [8]  [9].  Because  of  our 
intended  application  to  a  highly  parallel  computing  machine  with  simple  processing  elements 
and  a  reduced  instruction  set,  we  utilize  a  BIA  with  only  three  fundamental  operations  that 
can  implement  any  binary  image  transformation.  For  example,  the  counting  function,  which 
gives  the  number  of  pixels  having  a  certain  level,  is  considered  a  mapping  from  a  picture  type  of 
operand  to  another  number  type  of  operand  in  references  [8]  and  [9];  in  BLA  numbers  are  also 
represented  as  images  [19].  BLA  suggests  several  simple  but  fast  parallel  image  algorithms  and 
a  parallel  image  processing  architecture  with  a  very  low  cell  complexity. 

Previous  Work  on  Cellular  Logic  Architectures 

To  match  BIA  parallel  algorithms  by  cellular  logic  architectures  in  a  transparent  way,  we  char¬ 
acterize  a  cellular  automaton  by  ah  algebraic  structure  as  BIA  does.  The  cellular  logic  computer 
was  first  inspired  by  the  writings  of  von  Neumann  [20][2l]  on  cellular  automata.  A  sequential 
process  of  cellular  logic  operations  is  described  in  Fig.  1.  Some  review  of  cellular  image  proces¬ 
sors  can  be  found  in  Ref.  [21]-[25].  Many  cellular  computers  have  been  constructed  previously 
for  implementing  cellular  logic  operations,  and  some  ideas  for  extending  the  nearest-neighbor 
connected  cellular  logic  computers  for  improving  speed  and  flexibility  have  been  proposed  [24]. 
These  architectures  include:  (1)  the  cellular  string  (Fig.  2(a));  (2)  the  cellular  array  (Fig.  2(b)); 
and  '3)  the  cellular  hypercube  (Fig.  2(c))  and  the  cellular  pyramid  (Fig.  2(d)).  These  three 
architectures  share  a  common  feature  in  the  simplicity  and  regularity  of  interconnecting  simple 
processing  elements,  and  represent  an  interconnection  topology  in  1-D,  2-D,  and  3-D,  respec¬ 
tively.  The  3-D  case  is  difficult  to  implement  on  a  planar  VLSI  chip  [24]  [26]  [27],  but  may 
be  realizable  by  a  digital  optical  system  [28][29].  Two  promising  architectures  based  on  digital 
optical  cellular  image  processors  (DOCIPs),  DOdP-anay  and  DOCIP-hypercube,  are  presented 
below  as  a  means  of  implementing  BIA  effectively. 

Definition  of  Algebraic  Structure 

An  algebraic  structure  (or  algebra)  [30]-[32]  is  a  pair  (or  system)  A  =  (S,F)  where 

•  5  is  a  set,  and 

•  F  is  a  family  of  operations  which  are  functions: 

/  :  S*  -  5, 


and  k  is  a  finite  non-negative  integer. 
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Remark:  For  any  finite  aon-negacive  integer  k,  we  define  a  k- ary  operation  on  5  as  an  operation 
which  is  a  function  /  :  5*  —  5.  Thns,  a  unary  (or  1-ary)  operation  on  S  is  simply  a  function 
on  5  to  5.  A  binary  (or  2-ary)  operation  on  S  is  a  function  on  S2  to  S.  For  completeness,  we 
define  a  nullary  (or  0-ary)  operation  on  5  to  be  a  particular  element  of  5. 

Therefore,  the  problem  to  be  solved  is  essentially  to  find  a  “good”  algebraic  structure  (5,  F) 
for  parallel  binary  image  processing,  i.e.  to  search  for  5  and  F,  and  its  “good”  hardware 
implementation. 

Outline 

Section  2  contains  the  framework  of  BLA:  subsection  2.1  gives  the  basic  definitions;  subsection 
2.2  presents  two  fundamental  principles  which  prove  the  completeness  of  BLA. 

Section  3  describes  some  applications  of  BLA:  subsection  3.1  reviews  basic  properties  of 
images  and  image  transformations,  and  derives  from  them  some  standard  image  operations; 
subsection  3.2  gives  some  useful  theorems  for  properties  of  image  operations,  for  applications  in 
morphological  filtering,  shape  recognition,  “salt”  and  “pepper”  noise  removal,  size  and  location 
verification. 

Section  4  discusses  the  relationship  of  BLA  and  other  computing  theories:  subsection  4.1 
describes  the  relationship  with  boolean  logic;  subsection  4.2  describes  the  relationship  with 
symbolic  substitution  and  cellular  logic;  subsection  4.3  describes  the  relationship  with  linear 
shift  invariant  system  theory,  convolution  and  correlation;  subsection  4.4  descibes  some  standard 
algebraic  structures  supported  by  BLA. 

Section  5  contains  the  implementation  of  BlA  on  digital  optical  cellular  image  processors 
(DOCIPs):  subsection  5.1  gives  the  algebraic  description  of  the  DOCIPs  which  have  the  same 
algebraic  structure  as  BLA;  subsection  5.2  gives  the  general  description  of  the  DOCIPs. 

Finally,  the  programming  of  the  DOCIPs  is  illustrated  in  Section  6. 


2  Binary  Image  Algebra  (BIA)  Fundamentals 

The  overall  philosophy  of  BIA  is: 

•  An  image,  but  not  a  pixel,  is  an  object.  For  parallel  languages  and  machines  for  image 
processing,  images  can  be  considered  as  primitive  variables  for  simplifying  the  design. 

•  Complex  image  processing  operations  can  be  reduced  to  simple  instructions.  Although 
image  processing  operations  appear  complex,  the  fundamental  interactions  and  the  ele¬ 
mentary  components  in  a  system  are  very  simple. 

Thus,  BIA  begins  by: 

1.  Defining  the  universal  image  as  the  working  space  for  images  and  their  image  transforma¬ 
tions. 

2.  Defining  elementary  images  which  can  be  combined  to  generate  any  image. 

3.  Defining  fundamental  operations  which  can  be  cascaded  to  form  complex  operations. 
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4.  Defining  image  processing/ analysis  algorithm  design  as  the  choice  of  •‘good"  (or  ■‘appro¬ 
priate")  reference  images  and  transformations. 

A  reference  image  can  be  any  image  and  is  a  generalization  of  structuring  elements  in  mathemat¬ 
ical  morphology  [2].  Reference  images  contain  some  predefined  image  property  (or  information); 
image  transformations  (or  operations)  are  used  for  measuring  the  image  property  from  an  input 
image.  Image  description,  image  information  extraction  or  image  property  measurement  is  done 
by  using  reference  images  to  model  or  transform  the  original  image  to  a  final  state  which  reveals 
the  desired  information  or  is  used  to  detect  the  desired  properties  easily. 

Here  we  give  the  algebraic  structure  of  BLA  first,  an  then  provide  definitions  and  present 
two  fundamental  principles  which  allow  us  generate  any  reference  image  and  implement  any 
image  transformation.  Ideally,  BIA  may  be  further  generalized  to  GLA  (General  Image  .Algebra) 
which  deals  with  grey-level  and  complex-valued  images. 

2.1  Definitions 

Definition  of  Binary  Image  Algebra  (BLA.) 

Binary  image  algebra  is  an  algebra  with  an  image  space  S,  which  is  the  power 
set  of  a  predefined  universal  image  P(W),  and  a  family  F  of  operations  including  3 
fundamental  operations  (®,U,*),  which  are  non  0-ary  operations,  and  5  elementary 
images  (/,  A,  A~1,B,B~1),  which  are  0-ary  operations.  Symbolically, 

BIA  =  (P(W);®,ur,I,A,A-l,B,B-1) 

i.e.  S  =  P(W )  and  F  =  (®,U,~,/, A, A~l ,B,B~l).  The  image  space  5  and  the 
family  F  of  operations  will  be  derived  in  the  following. 


Basic  Definitions 

In  general,  a  binary  digital  image  is  defined  as  a  function  /  that  maps  each  grid  point  (x,  y) 
of  the  picture  on  an  orthogonal  coordinate  system  onto  the  set  composed  of  two  elements:  1  (i.e. 
white,  foreground  point  or  image  point)  and  0  (i.e  black  or  background  point).  However,  it  will 
be  more  convenient  for  our  algebra,  if  we  use  a  set  of  the  coordinates  of  image  points  (‘l’s)  to 
specify  an  image.  In  this  paper,  an  image  is  treated  as  the  set  of  coordinates  of  image  points 
(i.e.  foreground  points  or  pixels  that  have  value  1).  We  begin  the  description  of  BIA  by  defining 
our  artificial  universe: 

Definition  t.l  The  Universal  Image. 

The  universal  image  is  the  set  W  =  {(*,y)  j  z  6  Zn,y  €  Zn},  where  Z„  =  {0,±l,±2,...,±n} 
and  n  is  a  positive  integer  (Fig.  3). 

Remark:  “S”  means  “belongs  to”.  Notice  that  given  n,  the  universal  image  defines  the  domain 
of  our  images.  In  fact,  for  an  image  with  size  larger  than  (2n  f  l)x  (2n  +  1)  (the  size  of  the 
universal  image),  we  need  to  increase  the  size  of  the  universal  image  or  decompose  the  tested 
image  into  subimages  whose  sizes  are  smaller  than  the  size  of  the  universal  image.  For  the 
reason  of  simple  practice,  we  only  consider  the  square  tessellation  of  images.  To  deal  with 
non-square  tesselations  (such  as  the  hexagonal  digitization),  we  can  simply  replace  the 
universal  image  to  be  the  set  of  grid  points  corresponding  to  the  new  digitization  pattern. 
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Definition  2.2  Image  Space. 

The  image  space  is  the  power  set  (the  set  of  ail  subsets)  of  the  universal  image,  i.e.  5  =  P(W\ 
Definition  2.3  Image. 

A  set  X  is  an  image  if  and  only  if  X  is  an  element  of  the  image  space  5,  i.e.  X  is  a  subimage 
(subset)  of  the  universal  image  W.  Symbolically, 

X  is  an  image  —  X  6  S  <-*  X  C  W. 

Remark:  “C”  means  *is  included  in”.  There  exist  2(2n+1'*(2n+1>  different  images.  Three  terms 
related  to  images  are  defined: 

1.  Size  (or  area)  of  an  image  X,  denoted  as  #(X),  is  the  cardinality  (i.e.  the  number  of 
elements)  of  the  image  X. 

2.  Foreground  of  an  image  X,  simply  denoted  as  X ,  is  referred  to  those  pixels  with  value  1. 

3.  Background  of  an  image  X,  denoted  as  the  complement  X  (Definition  2.6),  is  referred  to 
chose  pixels  with  value  0. 

Once  we  know  the  foreground  of  an  image,  the  background  of  this  image  is  well  defined  (since 
the  universal  image  is  given  first).  Thus,  the  foreground  is  sufficient  to  specify  an  image. 

Definition  2.4  Image  Point  ( Foreground  Point). 

A  point  (x,  y)  is  an  image  point  of  an  image  X  if  and  only  if  (x,  y)  is  an  element  of  the  set  X. 
Remark:  The  largest  image  is  the  universal  image  W  and  consists  of  (2n  +  1)  x  (2n  +  1)  image 
points,  i.e.  #(W)  =  (2n  +  1)  x  (2n  4-  1);  the  smallest  image  is  the  null  image  f>  (defined  as  the 
complement  <s>  =  W)  and  has  no  image  points,  i.e  ${<)>)  -  0. 

Definition  2.5  Image  Transformation. 

A  transformation  T  is  an  image  transformation  if  and  only  if  T  is  a  function  mapping  from  the 
image  space  5  to  the  image  space  5. 

Remark:  There  exist  (2(Jn+1)x<5n+l))(2<Jn+1>>‘(3"+l))  image  transformations. 

Definition  2.6  Three  Fundamental  Operations. 

There  are  three  fundamental  operations: 

1.  Complement  of  an  image  X  (Fig.  4(a)): 

X *{(*,»)  |  (*,j)€WA(*,y)*  A'} 

2.  Union  of  two  images  X  and  R  (Fig.  4(b)): 

X  u  R  =  {(x,y)  |  (z,  y)  6  X  v  (x,y)  €  R} 

3.  Dilation  of  two  images  X  and  R  (Fig.  4(c)): 


{(*i  +  *a,yi  +  ya)  €  W  |  (si,yi)  6  X,(x2,yj)  €  R)  (X  #  <t>)  A  (R  £  f>) 

<p  otherwise 


Remark:  “a”  means  '‘and”,  and  “v”  means  “or”.  Note  that  X  usually  represents  an  input  or 
data  image  and  R  is  a  reference  image.  The  consideration  of  null  image  in  the  dilation 
operation  is  missing  in  mathematical  morphology  (where  the  dilation  is  defined  the  union  of  all 
translations  of  X  by  all  image  points  in  R );  with  this  generalization  we  have  a  complete  theory 
which  is  not  found  in  other  image  algebras  because  of  less  demonstration  of  their  capabilities 
for  implementing  any  image  transformation.  We  can  also  define  other  image  operations  as 
fundamental  operations  instead  of  these  three  operations.  The  reason  for  choosing  these  three 
operations  is  because  of  their  simplicity,  simple  software  design  and  simple  hardware 
implementation.  As  shown  later,  these  three  operations  may  be  implemented  by  a  2-D  optical 
gate  array  with  3-D  interconnections. 

Definition  2.7  Elementary  Images. 

These  elementary  images  are  constant  images,  i.e.  0-ary  operations.  Each  elementary  image 
has  only  one  image  point.  There  are  5  elementary  images: 

1 .  I  -  {(0,0)}  —  consisting  of  an  image  point  at  the  origin 

2.  .4.  =  {(1,0)}  —  consisting  of  an  image  point  right  of  the  origin 

3.  A-1  =  {(-1,0)}  —  consisting  of  an  image  point  left  of  the  origin 

4.  B  =  {(0, 1)}  —  consisting  of  an  image  point  above  the  origin 

5.  B~l  =  {(0,-1)}  consisting  of  an  image  point  below  the  origin 

Remark:  In  fact,  these  5  elementary  images  could  be  reduced  to  4  elementary  images,  because 
/  =  A0  =  A  0  A”1  =  5°  s  5  ©  B~l. 

Definition  2.3  Reflected  Reference  Image. 

Given  a  reference  image  R  which  is  a  predefined  image  for  containing  some  desired  image 
property  or  image  information,  its  refected  image  is  defined  as 

£  =  {(-*»-y)  l(*,jr)€  5}- 

Remark:  In  many  useful  cases  the  reference  image  R  is  symmetric,  then  R  =  R. 

2.2  Two  fundamental  Principles 

Two  fundamental  principles  basically  define  the  binary  image  algebra  (BIA).  Before  stating  these 
two  principles,  we  give  some  preliminary  results. 

Lemma  2.1. 

(X  ©  X)  u  (X  ©  ft)  u  7  =  {  '  v.r.ft  €  P[W) 

where  I  ~  {(0,0)}  is  an  elementary  image,  R  is  the  reflected  reference  image  of  R,  and  “V” 
means  “for  all”. 

Proof.  Appendix  A. 

Remark :  This  lemma  says  that  if  the  image  X  matches  the  image  R,  then  the  origin  (central 
pixel)  of  the  above  output  image  has  value  ‘1’,  otherwise  always  ‘O’. 
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Theorem  2.1. 


Any  image  transformation  T  :  P(W)  —  P(W)  can  be  expressed  as 

k 

nx)  =  |j{(y®iii)u(jr®  A)u7®<?,} 

wl 

where  k  <  #(P(W)),  1?^  and  Q;  are  the  reference  images  used  to  form  any  desir”-'  image 
transformation,  and 

k 

(J  R,  =  £1  u  R2  u  ...  u  Rk-  ■ 

i=i 

Proof.  Appendix  B. 

Theorem  2.2. 

.Any  image  can  be  represented  as 

X  =  (J  A'B> 

«.J)€X 

where  A'B^  =  A'  3  BJ, 

A’  =  A  ®  A  ©  ...  ©  A  =  {(»,  0)}  if  i  >  0, 

i 

A*  s  A-1  ®  A~l  ©  ...  ®  A~\  ={(i,0)}  if  i  <  0, 

—  t 

and  A,  B ,  A~x ,  5-1  are  the  elementary  images  defined  in  Definition  2.7. 

Proof  Appendix  C. 

Principle  1.  Fundamental  Principle  of  Image  Transformations 
Any  image  transformation  T  can  be  implemented  by  using  appropriate  reference 
images  R  and  the  three  fundamental  operations:  1.  Complement  X  of  an  image  X , 

2.  Union  u  of  two  images,  3.  Dilation  ©  of  two  images. 

Proof.  It  follows  from  Theorem  2.1. 


In  order  to  use  principle  1  efficiently  in  practice,  we  invoke  principle  2  for  the  generation  of 
reference  images. 

Principle  2.  Fundamental  Principle  of  Reference  Images 
Any  reference  image  R  can  be  generated  from  elementary  images  (/,  A,  A~l,B,  B~x) 
by  using  the  three  fundamental  operations. 

Proof.  It  follows  from  Theorem  2.2. 

Therefore,  by  the  above  principles,  we  can  represent  BIA  as: 

BIA  =  (P(W);  ©,  U,  A,  A~l,B,  B~l). 
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3  Development  of  Binary  Image  Algebra  (BIA) 

BIA  can  have  many  applications  in  character  recognition,  industrial  inspection,  medical  image 
processing,  and  scientific  computation.  In  this  section  we  first  review  the  basic  properties  of 
images  and  image  transformations,  define  11  standard  operations,  and  give  some  special  cases 
of  dilation  [2]-[5]  [33]-[36j.  Then  we  summarize  5  theorems  for  binary  image  processing. 

This  section  is  primarily  a  survey  of  binary  image  processing  algorithms  with  implementation 
using  BIA  fundamental  operations.  These  fundamental  operations  are  so  chosen  because  they 
form  an  efficient  basis  for  the  instruction  set  of  an  optically-based  cellular  image  processor.  This 
survey  serves  as  a  description  of  a  parallel  language  for  controlling  the  processor  and  how  it 
is  compiled  into  low  level  instructions.  The  use  of  BIA  for  parallel  numerical  computation  is 
described  in  [19]. 


3.1  Basic  Properties  of  Images  and  Image  Transformations 
Definition  3.1  Connectivity  in  Images 

1.  4-neighbor  and  8-neighbor: 

An  image  point  (x,y)  in  an  image  X  can  have  two  types  of  neighors: 

(a)  An  image  point  (t,  j)  is  a  4-neighbor  of  (x,y) 

~  0\j)  €  {(x±l,y),(x,yil)}. 

Remark:  {(x,  y),  (x  ±  1,  y),  (x,  y  ±  1)}  is  called  the  4- neighborhood  of  (x,  y)  and  X*  s 
{(0, 0), (0, ±1), (il, 0)}  =  IV  AU  A^uBur1  (Fig.  5(a)). 

(b)  An  image  point  ( i,j )  is  a  8-neighbor  of  (x,y) 

—  (*,j)  €  {(x±  l,y),(x,y±  l),(x±  l,y  ±  1)}. 

Remark:  {(x,y),(x  ±  l,y),(x,y  ±  l),(x  ±  l,y  ±  1)}  is  called  the  8-neighborhood  of 
(x,y)  and  Na  s  {(0,0),(0,±1),(±1,0),(±1,±1)}  (Fig.  5(b)). 

2.  4-connected  and  8-connected: 

(a)  Two  image  points  (x,y)  and  (i,j)  of  an  image  X  are  4-connected 

♦-»  there  exists  a  sequence  of  image  points  (x,y)  =  (xo- yo),(xi,  yi), ...  ,(xn, Vm)  ~ 
(i,  j),  where  (xfc,yn)  is  a  4-neighbor  of  ( Xk-i,yk-i )  and  (xfc,yk)  6  X,  l  <  k  <  m. 

(b)  Two  image  points  (x,y)  and  ( i,j )  of  an  image  X  are  8-connected 

*-*  there  exists  a  sequence  of  image  points  (x,y)  =  (xo,  yo),(xi,  yi ),...,  (xm,ym)  = 
where  (x*,y*)  is  a  8-neighbor  of  (x*_i,y*_i)  and  (xk,y*)  €  X,  1  <  k  <  m. 

Remark  1:  “4-connected  in  X ”  and  “8-connected  in  X ”  are  equivalence  relations  (reflexive, 
symmetric  and  transitive). 

Remark  2:  For  any  image  point  (x,  y)  in  a  non-null  image  X,  the  set  of  (i,j)  such  that 
(x,y)  and  (i,j)  are  4-connected  (or  8-connected)  is  called  a  4-connected  (or  8  connected) 
component  of  X.  A  4-connected  (or  8-connected)  component  of  A  is  just  an  equivalence 
class  in  X  under  the  equivalence  relation  —  “4-connected  (or  8-connected)  in  X .  Thus, 
a  collection  of  4-connected  (or  8-connected)  component  of  X  forms  a  partition  of  X ,  i.e. 
the  set  of  all  4-connected  (or  8-connected)  components  {At},g/  (where  I  is  the  index  set 
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of  connected  components)  is  a  family  of  non-nuil  subimages  of  X  and  has  the  following 
properties: 

(a)  Xi  ^  <p  for  all  i  €  I- 

(b)  Xi  n  Xj  =  ®  for  all  i  j,  »,;€/.  (X,  n  X,  =  1T%uT^  as  denned  in  Definition  3.31 

(c)  X  =  U,€<X„ 

Fig.  8(a)  shows  a  4-connected  component  in  an  image  X  and  Fig.  6(b)  5  ,'ws  an  3- 
connected  component  in  X. 

Remark  3:  If  an  image  X  has  l  -4-connected  (or  3-connected)  components,  there  are  / 
distinct  equivalence  classes  in  X.  Each  equivalence  class  Xt  can  be  represented  by  an 
image  point  in  X,.  Thus,  we  may  use  l  distinct  image  points  which  belong  to  l  different 
4-connected  (or  3-connected)  components  to  represent  the  classes  of  the  image  X. 

Remark  4:  In  dealing  with  connectedness  in  both  X  and  X,  to  avoid  the  “connectivity 
paradox”  [33],  it  is  preferable  to  use  opposite  types  of  connectedness  for  X  and  X,  i.e.  if 
we  use  “4-connected"  for  X,  then  we  use  “3-connected”  for  X,  and  vice  versa. 

Remark  5:  If  any  image  X  is  surrounded  by  a  border  of  0’s,  the  component  of  X  consisting 
of  the  points  connected  to  (any  one  of)  these  0’s  is  called  the  outside  of  X  (Fig.  7(a)).  If 
X  has  any  other  components,  they  are  called  holes  in  X  (Fig.  7(b)). 

For  more  detailed  discussion  of  geometric  properties  of  images,  the  reader  is  referred  to 
[33]-[35].  For  equivalence  relations,  equivalence  classes  and  partitions,  please  refer  to  [30]-(32]. 

Definition  3.2  Basic  Properties  in  Image  Transformations 

The  key  properties  of  image  transformations  are  the  following  ten  basic  properties 

1.  Increasing:  An  image  transformation  T{X)  is  increasing 

-  (X  C  Y  -  T(X)  C  T{Y))  for  all  X,  Y  €  P(W). 

2.  Decreasing:  An  image  transformation  T(X)  is  decreasing 

-  {X  C  Y  -  T(Y)  C  T{X))  for  all  X,  Y  €  P{W). 

3.  Extensive:  An  image  transformation  T(X)  is  extensive 

-  X  C  T(X)  for  ad  X  €  P{W). 

4.  Antiextensive:  An  image  transformation  T{X)  is  antiextensive 
«  T{X)  C  X  for  all  X  6  P(W). 

3.  Idempotent:  An  image  transformation  T(X)  is  idempotent 
«  T(T{X))  =  T(X)  for  aU  X  6  P(W). 

6.  Shift  invariant:  An  image  transformation  T(X)  is  shift  invariant 

«■*  T(X  ©  P)  =  T(X)  ©  P  for  all  X,  P  6  P(W)  and  P  is  a  point  image  which  consists  of 
one  and  only  one  image  point. 

If  an  image  transformation  is  not  shift  invariant,  then  it  is  shift  variant: 

T(X  ©  P)  £  T(X)  ©  P  (in  general). 
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T.  Homotopic:  An  image  transformation  T{X)  is  homotopic 

-  there  exists  a  one-to-one  and  onto  correspondence  between  the  connected  components 
of  X  and  those  of  T’(X),  for  ail  X  S  P(W).  The  same  is  then  true  for  the  holes. 

3.  Commutative:  A  binary  image  operation  •  is  commutative 

-  X  ■  R  =  R  ■  X  for  all  X,R  6  P{W). 

9.  Assoriative:  A  binary  image  operation  •  is  commutative 

-  [X  ■  R)  ■  Q  =  X  ■  (R  ■  Q)  for  all  X,  £,  Q  e  P{W). 

10.  Distributive:  A  binary  image  operation  •  is  distributive  over  a  binarv  image  operation  — 

-  X  ■  {R  +  Q)  =  {X  •  R)  +  (X  ■  Q)  for  all  X,  R,Q  6  P(W). 


Definition  3.3  Standard  Operations 

Most  standard  operations  can  be  derived  from  the  three  fundamental  operations;  eleven  common 
ones  follow: 

1.  Difference  of  X  by  R  (Fig.  3(a)): 


X/R  =  {(x,y)  €  X  |  (x,y)  g  R}  =  Xn  J  =  Zu  R 

Remark:  fT  —  Wf  X  where  W  is  the  universal  image.  The  difference  is  an  obvious  approach 
to  detect  defects  in  the  foreground  of  a  tested  image. 

2.  Intersection  of  two  images  X  and  R  (Fig.  8(b)): 


Xn  R  =  {(x, y)  |  ( x , y)  6  X  a  (x, y)  €  £}  =  Xu  H 

Remark:  X  U  R  —  YcTH.  If  X  n  R  jk  <f>,  then  we  say  that  an  image  X  hits  (or  is  joint 
with)  an  image  R.  If  X  n  R  =  <£,  then  we  say  that  an  image  X  misses  (or  is  disjoint  with) 
an  image  R 

3.  Erosion  of  an  image  X  by  a  reference  image  R  or  foreground  template  matching  of  X  by 
R  (Fig.  8(c)): 


XQR=J®R 


Remark:  X  ©  R  =  X  0  R,  and  R  =  R  when  R  is  symmetic.  The  erosion  of  an  image  X  by 
a  reference  image  R  can  be  thought  of  as  the  complement  of  the  dilation  of  the  backgound 
by  the  reflection  of  the  reference  image  R.  In  general,  the  erosion  of  a  non-null  image 
X  by  a  non-null  reference  image  R  can  be  used  to  decrease  the  size  of  regions,  increase 
the  size  of  holes,  eliminate  regions,  and  break  bridges  in  X;  on  the  contrary,  the  dilation 
of  a  non-null  image  X  by  a  non-null  reference  image  R  can  increase  the  size  of  regions, 
decrease  or  fill  in  holes  and  cavities,  and  bridge  gaps  in  X.  Furthermore,  the  erosion  can 
be  interpreted  as  a  foreground  template  matching  where  the  foreground  points  of  X  ©  R 
indicates  the  ocurrences  of  the  foreground  template  R  in  X  (in  this  purpose,  the  size  of  R 
usually  is  much  smaller  than  the  size  of  X). 
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4.  Symmetric  Difference  of  two  images  (mod  2  image  addition  or  subtraction)  (Fig.  3(d)): 


X  A  R  =  (XI R)  u  (R/X)  =  Xu  R  u  lul 

Remark:  The  symmetric  difference  is  a  commutative  operation,  and  its  inverse  operation 
can  be  defined  as  itself.  In  section  4  we  show  that  this  operation  is  the  parallel  form 
of  boolean  EXCLUSIVE-OR.  It  is  an  obvious  approach  to  detect  defects  (including  the 
foreground  or  background  defects)  of  a  tested  image. 

5.  Opening  of  an  image  X  by  a  reference  image  R  (Fig.  3(e)): 

Xo  R  =  (XQR)SR  =  Til  a  R 

Remark:  The  opening  operation  is  an  erosion  followed  by  a  dilaton  with  the  same  reference 
image  R.  In  general,  the  opening  X  o  R  with  a  non-null  reference  image  R  reduces  the  sine 
of  regions  and  eliminates  some  image  points  by  removing  all  features  in  X  which  can  not 
contain  the  reference  image  R. 

6.  Closing  of  an  image  X  by  a  reference  image  R  (Fig.  3(f)): 


X*R  =  (X$R)QR  =  (X®R)$R 

Remark:  The  closing  operation  is  a  dilation  followed  by  an  erosion  with  the  same  reference 
image  R.  In  general,  the  closing  X  •  R  with  a  non-null  reference  image  R  increases  the  size 
of  regions  and  eliminates  some  background  points  by  filling  in  all  background  areas  that 
can  not  contain  the  reference  image  R ,  snch  as  holes  and  concavities  in  the  image  X. 

7.  Hit  or  miss  transform  ®  of  an  image  X  by  an  image  pair  R  =  (Ri,R2)  or  template 
matching  of  X  by  R  (Fig.  8(g)): 


X  ®R  =  (X  Q  R\)  n  (X  0  R2)  =  (X®  R\)  u  [X  ©  R2) 

Remark:  The  hit  or  miss  transform  of  an  image  X  by  a  reference  image  pair  R  =  ( R\ ,  R2 ) 
is  used  to  match  the  shape  (or  template)  defined  by  the  reference  image  pair  R  where 
defines  the  foreground  of  the  shape  and  R2  defines  the  background  of  the  shape.  The  key 
conditions  are  that  the  foreground  X  must  match  R\  (i.e.  X$R\),  while  simutaneously  the 
background  X  matches  R2  (i.e.  ~XqR2).  In  order  to  better  define  the  hit  or  miss  transform 
and  its  relationship  with  conventional  boolean  logic  operations,  we  start  from  a  pixel-wise 
boolean  comparison  to  derive  the  hit  or  miss  transform  in  shape  recognition  (Theorem 
3.4).  Note  the  similarity  of  the  symmetric  difference  and  the  hit  or  miss  transform. 

3.  Thinning  ®  an  image  X  by  an  image  pair  R  =  [R\,R2)  (Fig.  8(h)): 

X(o)R  =  X/(X  ®  R)  =  Xu(X®  Ri)u(X®  R2) 

Remark:  The  thinning  operation  is  anciextensive  and  decreases  the  size  by  removing  the 
central  points  of  the  regions  which  match  the  reference  image  pair  R  =  (R\,R2). 
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9.  Thickening  O  image  X  by  an  image  pair  R  =  (R2,R2)  (Fig.  3(i)): 

X  QR  =  Xu(XQR)  =  Xu(T&Ri)u(X&R2) 

Remark:  The  thickening  operation  is  extensive  and  increases  the  size  by  filling  the  image 
points  where  the  regions  match  the  reference  image  pair  R  =  (R2,R2). 

10.  Sequential  operations  (e.g.  sequential  dilation,  sequential  erosion,  sequential  thinning  etc.): 
If  an  image  operation  •  is  successively  performed  with  each  reference  image  (or  image  pairs) 
in  a  sequence  ( R e)  =  (Ra,Rb, ....  R:),  then  we  define  a  sequential  image  operation 

X-(Rg)  =  (...((X-Ra)-Ri)...'-R,). 


Two  examples  are: 

(a)  Sequential  thinning  of  an  image  X  by  a  sequence  of  image  pairs  (Re)  s  (ft,,  Rb, ...,  Rz): 

X@(R4)  =  (...((X(5)Ra)(3)Ri)...@Rt). 

Remark:  The  sequential  thinning  is  powerful  in  many  applications,  such  as  construct¬ 
ing  a  digital  homo  topic  skeleton  of  an  image  X.  Skeletonization  of  an  image  is  an 
operation  that  transforms  the  image  to  a  simplified  image,  called  skeleton,  which 
emphasizes  its  connectivity.  However,  a  homotopic  skeleton  cannot  be  obtained  by 
digitizing  an  analog  skeletonization  algorithm;  instead,  a  sequential  thinning  with  a 
sequence  of  reference  image  pairs  should  be  used.  Several  different  algorithms  em¬ 
ploying  different  reference  image  pairs  (called  masks)  have  been  proposed  by  several 
authors  [6][36].  Fig.  8(j)  shows  an  example  of  the  skeletonization  by  a  sequential 
thinning  with  a  sequence  of  eight  reference  image  pairs  proposed  by  Levialdi  et  al 

[36]. 

(b)  Sequential  dilation  of  an  image  X  and  a  sequence  of  reference  images  (Re)  s  (£„,  A,, ...,  Rt 

XQ(Re)  =  (...((X  ©  AO  ©  Rb)-  ©  A,)- 

Remark:  Since  the  dilation  is  commutative  and  associative,  in  practice  the  dilation 
X  ©  R  with  a  large  reference  image  R  is  usually  implemented  as  a  sequential  dilation 
with  a  sequence  of  small  reference  images.  For  example,  if  R  =  E\  ©  E2  ©  ...  0  Ei,  , 
then  • 

(...((*  ©  Ei)  ©  £2)  ©  -  ©  £*); 

and  if  E  =  E\  =  E2  =  ...  =  £*,  then 

R=  Ek  =  £®E  ©  -  ©  E. 

>  ■  1  ■■  y  ■ 

k 

11.  Conditional  operations  (e.g.  conditional  dilation,  conditional  erosion,  conditional  thinning 
etc.): 

An  image  operation  ■  between  an  image  X  and  a  reference  image  (or  image  pairs)  R 
performed  within  a  limiting  set  Y  is  called  a  conditional  operation  and  is  denoted 

X  ■  R  |  Y  =  [X  ■  R)  n  7  =  X-RliY 

Remark:  Fig.  8(k)  gives  an  example  of  the  conditional  dilation.  . 
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3.2  Examples  of  Special  Cases:  Translation  (Shifting),  Expansion,  Shrinking, 
and  Projection 

Translation  (shifting),  expansion,  shrinking  and  projection  in  a  direction  can  by  achieved  by  the 
dilation  (or  erosion)  in  a  direct  way. 

1.  Shifting  an  image  X  from  coordinate  lx,  y)  to  coordinate  (x  +•  i,  y  +  j)  is  done  by 

{(*,;)}  =  - re  {(-«,-;)}. 

Remark:  A  point  image  {(:,;)}  corresponds  to  a  discrete  delta  function  at  5(x  -  i,y  -  j). 
Thus,  an  image  function  X(x,y)  (which  corresponds  to  the  image  X)  convolved  with  the 
delta  function  <S(x  -  i,  y  -  j)  or  correlated  with  S(x  -r  t,  y  +  j)  is  the  same  as  X  ®  {(t,  j)}  = 

XQ  {(-«,-;)}• 

2.  Adding  a  new  3-connected  or  4-connected  boundary  to  an  image  X  (i.e.  expansion)  is  done 
by 

X  3  .V4  or  X  3  ,Va 

where  iV 4  =  lu  A  U  A-1  u  B  u  B~l  and  jV8  =  A'B3. 

3.  Removing  the  8-connected  or  4-connected  boundary  of  an  image  X  (i.e.  shrinking)  is  done 
by 

XQX<  =  TTnI  or  X  e  =  TWNs 
where  3  I U  A  U  A~l  l>  B  U  B~l  and  ,V,  s  A'Bj- 

4.  Projecting  an  image  X  to  distance  fe  in  a  direction  9,  i.e.  producing  a  shadow  of  X  where 
the  furthest  image  point  in  the  shadow  in  the  direction  9  is  at  distance  k  from  the  furthest 
image  point  in  X  in  the  direction  9,  this  can  be  achieved  by 

X®  0* 

where  0  can  be  any  one  of  the  following: 

•  East:  £  =  /uA,  Ek  =  Uto^‘ 

•  South:  5  =  IuB~\  Sk  =  ULo  B" 

•  West:  W  =  III  A-1,  Wk  =  (J^0  A" 

•  North:  X  =  /  U  B,  Nk  =  Ulo  B' 

•  Southeast:  SE  =  /U  AB~\  SkB  =  (J-.o  A'S" 

•  Southwest:  Sw  =  /  U  A_15_1,  =  IjLo  A~'B~' 

•  Northwest:  Mw  =  /U  A"1  B,  =  (J^0  A~'Bi 

•  Northeast:  NB  =  /U  AJ,  iV|  =  (J^o  A’S’' 

•  Horizontal:  H  =  UL-i  A',  Bk  =  A* 

•  Vertical:  V  =  (JL-i  V*  =  U*,_fc  5’ 

•  Left-diagonal:  Id  =  l)L-i  A*’B‘,  Ifc  =  U U-k^'B' 

•  Right-diagonal:  Rd  —  UU-i  A'B',  Rq  —  U«-ie  A’ B' 
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3.3  Theorems  for  Low  Level  Vision 

Here  we  summarize  five  theorems  for  binary  image  processing  applications.  Theorem  3.1  gjv-»s 
basic  properties  of  the  BIA  fundamental  operations  and  standard  operations.  Theorems  3.2- 
3.5  describes  the  implementation  of  morphological  filters,  shape  recognition  algorithms,  '‘salt” 
and  “pepper”  noise  removal,  and  size  and  location  verifications.  Those  more  obvious  proofs  are 
'  omitted  for  brevity.. 

Theorem  3.1  Properties  of  Image  Operations 

The  BIA  fundamental  operations  and  standard  operations  have  the  properties  shown  in  Table 
1(a)  and  Table  1(b). 

Proof.  Appendix  D  gives  some  of  their  mathematical  expressions  which  follow  form  the 
definitions. 

Theorem  3.2  Morphological  Filters 

Many  image  transformations  are  interpreted  as  morphological  filtering  [2]  or  cellular  filtering  [61. 
The  major  mophological  filters  are  listed  in  the  following: 

1.  One  kind  of  morphological  low  pass  filter  (Fig.  9(a)):  to  remove  high  frequencies  in  the 
foreground  of  am  image  X  can  be  achieved  by  opening,  i.e. 


Xo  R  =  {XQR)®R  =  T®RdR. 

2.  A  second  kind  of  morphological  low  pass  filter  (Fig.  9(b)):  to  remove  high  frequencies  in 
the  background  of  an  image  X  can  be  achieved  by  closing,  i.e. 


X  •  R  =  {X  ©  R)  0  R  =  {X  ©  R)  ©  R. 

3.  A  morphological  high  pass  filter  (as  shown  in  Fig.  9(c))  which  removes  low  frequencies  in 
the  foreground  of  an  image  X  can  be  achieved  by  the  difference  of  X  and  its  opening,  i.e. 


x/{x  o  r )  =  x/{(x  ©  R)  ®  R)  =  x/(X  ©  r  ©  R)  =  x  u  (X®  r  ©  r). 

4.  A  morphological  band  pass  filter  (as  shown  in  Fig.  9(d))  which  removes  low  frequencies 
and  high  frequencies  in  the  foreground  of  an  image  X  can  be  achieved  by  the  difference  of 
its  opening  with  a  smaller  reference  image  R  and  its  opening  with  a  larger  reference  image 
Q ,  where  RC  Q,  i.e. 


(XoR)/{XoQ) 


(( XQR)QR)/((XQQ)®Q ) 

((X©T)©fi)/((X©q)©<3) 
(X  0  A  ©  R)  u  (X$Q  ©  Q) 
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Theorem  3.3  Shape  Recognition  ( Template  Matching) 

1.  The  locations  of  a  shape,  that  is  defined  by  a  non-null  reference  image  R  and  a  non- anil 
reference  image  (called  mask)  M  (Fig.  10(a)),  with  R  C  M  C  W  (W  is  the  universal 
image),  can  be  detected  by 


( X  9H)n(l0  (M/R))  =  (X  ©  R)  u  (X  ©  (M/R))  =  (X 0  R)  u  (X  9  M  u  A). 

Equivalently,  setting  =  R,  Ri  =  .Vf/j?  and  redefining  a  non-null  reference  image  pair 
=  (i?i,  i?2)  (Fig.  10(b))  yields  the  hit  or  miss  transform  of  X  by  R: 


X  ®R  =  (X  e  R\)nCZQ  Ri)  =  Rx)u(X  $  A,). 

2.  The  locations  of  a  shape,  that  is  defined  by  a  family  of  non-null  reference  image  pairs 
{!?(#)}  with  9  €  ©  (0  is  the  index  set  of  the  family  of  non-null  reference  image  pairs  and 
i?(0)  =  (Ri(9),R2(9)),  can  be  detected  by  the  union  of  the  hit  or  miss  transform  of  X  by 
R(0): 

U  X®R(8)=  1J  (XeRx(9))n(XQR2(9))=  (J  (X©  Rx(9))  u  (X  0  £,(*)). 

see  see  see 

/’too/:  Appendix  E. 

Theorem  3.4  “Salt’  and  * Pepper *  Noise  Removal 

1.  “Salt”  noise  removal  (isolated  Image  point  removal)  (Fig.  11(a)):  to  remove  an  image  point 
if  its  4-connected  or  8-connected  neighbors  are  background  points  (0’s)  can  be  achieved  by 

XQ  Q*  =  X  u  X  6  .Vf4  or 

XQQa  =  XuI®Mj 

where  Q4  =  (,Vf4,I),  Q&  =  (Afg,/),  Af4  =  AU  A-1  U  5  U  f?'1  =  ,V4//  and  A/g  =  N%/I. 

2.  “Pepper”  noise  removal  (interior  fill)  (Fig.  11(b)):  to  create  an  image  point  at  a  coordinate 
if  its  4-connected  or  8-connected  neighbors  are  image  points  (l’s)  can  be  achieved  by 


X  (5)  H4  =  X  U  X  ©  A4“4  or 
X®R9  =  X  u  X  ©  Mi 


where  R 4  =  (I,  A/4),  R9  =  (I,  A/g). 

3.  “Salt  and  pepper”  noise  removal  (Fig.  11(c)):  to  remove  noise  points,  that  are  completely 
surrounded  with  4-connected  neighbors  or  8-connected  neighbors  of  the  opposite  value,  can 
be  achieved  by 
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Proof:  Appendix  F. 

Remark:  This  theorem  demonstrates  the  fact  that  many  higher  levei  operations  (e.g.  involv¬ 
ing  thinning  and  thickening)  can  be  efficiently  mplemented  by  the  three  fundamental  operations. 
Using  the  same  design  methodology  as  the  ‘‘salt  and  pepper*  noise  removal,  we  can  design  many 
similar  algorithms,  such  as  spur  removal,  bridge  break,  and  edge  detection  (perimeter)  etc.  For 
example,  the  detection  of  the  4-connected  or  3-connected  edge  of  an  image  X  (Fig.  12)  can  be 
achieved  by 


X/(X  8  -V,)  = 
X/(XQN<) 


JJ  .Va)  or 

ru(T©iV4). 


Theorem  3.5  Size  and  Location  Verification 

The  locations  in  an  image  X  of  the  regions  including  the  reference  image  R  and  included  in  the 
reference  image  Q,  where  R  C  Q,  can  be  detected  by 

S((X  0  R)K(X  QQ)®  Q)))  =  5((X®a)u(X©  <?©<?))). 

where  S(-)  means  the  homotopic  skeletonization.  (An  example  is  given  in  Fig.  13.) 

Proof:  Appendix  G. 

The  above  theorems  serve  as  the  typical  roles  for  morphological  image  processing.  In  fact, 
there  are  many  ways  to  analyze  the  shapes  and  sizes  of  an  image  only  using  by  the  three 
fundamental  operations.  As  another  example:  comparing  an  image  X  with  its  convex  hull  C(X) 
[34]  is  a  useful  technique  to  analyze  shape.  If  there  is  only  one  object  or  objects  separated  by 
distances  greater  than  their  own  diameters  in  the  image  X.  then  its  convex  hull  is  the  intersection 
of  projections  (Fig.  14(a)): 

(l(xeef) 

i*l 

« 

where  9,-,  *  =  1,2, 3, 4,  are  H,V,Rq,  Ld  (defined  in  Definition  3.4),  and  k  should  be  greater 
than  the  longest  radius  of  objects  in  X.  Then  the  difference  of  the  convex  hull  and  the  image 
C(X)/X  indicates  how  many  concavities  the  image  X  has  and  what  their  individual  shapes  and 
sizes  are.  Fig.  14(b)  illustrates  an  example. 


4  Relationship  to  Other  Computing  Theories 

4.1  Relationship  to  Boolean  Logic 

BIA  can  implement  any  boolean  logic  operation  on  binary  images.  It  is  also  obvious  that  BIA 
fundamental  operations  can  be  implemented  by  a  boolean  logic  gate  array  with  interconnections. 
The  following  straight-forward  correspondance  can  be  drawn  between  the  BIA  operations  and 
boolean  logic  operations: 


BIA  Operations  Boolean  Logic  Operations 

1.  Complement  NOT  "" 

2.  Union  OR 

3.  Dilation  Multiple-input  OR 

4.  Intersection  AND 

5.  Erosion  Multiple-input  AND 

■  6.  Symmetric  Difference  EXCLUSIVE-OR 

Note  that  the  inputs  of  OR  and  AND  (corresponding  to  union  and  intersection)  come  from  two 
different  images.  The  multiple  inputs  of  OR  and  AND  (corresponding  to  dilation  and  union) 
come  from  the  same  image  while  the  other  operand  image  R  only  determines  the  number  and 
location  of  input  pixel  values.  A  complete  logical  set  is  able  to  implement  any  boolean  logic 
function;  it  consists  of  at  least  one  of  the  following  sets:  NOT  and  OR;  NOT  and  AND;  NAND; 
NOR.  In  BIA.  in  order  to  implement  any  image  transformation,  we  need  a  complete  system  of 
pixel-wise  logic  operations  and  we  also  need  a  translational  type  of  operation  (such  as  translation, 
dilation,  erosion,  convolution  and  correlation  etc.)  to  allow  the  global  information  extraction  in 
an  image  or  the  information  exchange  between  pixels  of  the  same  images.  In  order  to  have  a 
2-D  compact  parallel  form  of  image  processing  algorithms  whc»e  variables  are  whole  images,  we 
define  the  parallel  form  of  those  corresponding  boolean  logic  operations  as  BIA  operations.  In 
fact,  there  are  two  boolean  algebras,  (f>(W);U,n,  ~,<p,W)  and  (P{W)\  A,0,  W ),  supported 
by  BLA  also  (subsection  4.4).  We  can  define  several  possible  sets  of  fundamental  operations  for 
implementing  any  image  transformation,  such  as  a  parallel  form  of  NOR  (or  NAND  or  (NOT 
and  OR)  or  (NOT  and  AND))  and  a  translational-type  operation  (e.g.  translation,  dilation, 
erosion,  convolution,  and  correlation  etc).  The  reason  why  we  choose  the  complement,  the  union 
and  the  dilation  as  the  three  fundamental  operations  is: 

•  Nice  mathematical  properties:  The  dilation  is  commutative,  associative,  and  distributive 
with  the  union;  but  the  erosion  has  no  such  properties. 

•  Simple  hardware  implementation:  These  three  operations  are  easily  implemented  by  the 
2-D  gate  array  and  3-D  interconnection  technique. 

•  Simple  software  design:  These  three  operations  are  inherently  parallel  and  frequently  used 
operations.  Algorithms  can  be  written  as  compact  formulas  which  easily  become  very  effi¬ 
cient  fast  parallel  algorithms  by  simply  applying  the  fundamental  operations  and  removing 
the  data  dependencies. 

Comparing  BIA  with  the  conventional  boolean  expressions  for  logic  functions,  the  major 
advantages  of  BIA  are  summarized  in  the  following: 

•  BIA  operations  are  inherently  parallel,  but  boolean  logic  operations  are  serial. 

•  BIA  operations  include  parallel  information  transferring  capabilities  which  is  missing  in 
boolean  logic  operations. 

•  Algorithms  in  BIA  are  written  as  compact  algebraic  formulas  whose  variables  a_e  whole 
images,  while  a  typical  image  processing  algorithm  is  very  difficult  to  write  in  a  compact 
precise  boolean  logic  expression. 

•  BIA  has  pictorial  physical  meaning,  while  boolean  expressions  provide  little  physical  feeling 
for  parallel  image  processing  algorithms. 


4.2  Relationship  to  Symbolic  Substitution  and  Cellular  Logic 

Symbolic  substitution  is  a.  means  of  performing  parallel  digital  computions  and  can  be  used 
to  implement  boolean  logic,  binary  arithmetic,  cellular  logic  and  Turing  machines  [37][38].  It 
involves  two  steps:  1)  recognizing  all  the  locations  of  a  certain  spatial  pattern  within  the  2- 
D  input  data,  and  2)  substituting  a  new  replacement-pattern  wherever  the  search-pattern  was 
recognized.  BL4  can  be  used  to  realize  a  symbolic  substitution  rule  as  follows: 

(X®R)$Q  =(ZqR1)u(X3  R2)QQ 

where  X  is  the  2-D  input  data,  R  =  (Ri,Rj)  is  the  reference  image  pair  corresponding  to 
the  search-pattern  (R\  and  R2  define  the  foreground  and  the  background  of  the  search-pattern 
respectively),  R  defines  a  reflected  reference  image  given  by  R  =  {(-z,  -y)  |  (z,  y)  €  R},  and  Q  is 
the  reference  image  corresponding  to  the  replacement-pattern.  Thus,  symbolic  substitution  rules 
are  particular  BIA  image  transformations  having  the  above  form;  and  BIA  represents  a  general 
complete  systematic  mathematical  tool  for  formalizing  the  symbolic  substitution  algorithms. 

Cellular  logic  architectures  have  been  briefly  reviewed  in  section  1.  A  cellular  logic  operation 
transforms  an  array  of  data  into  a  new  array  of  data  where  each  element  in  the  new  array  has  a 
value  determined  only  by  the  corresponding  element  in  the  original  array  along  with  the  values 
of  its  neighbors  (Fig.  1).  In  BIA,  an  image  transformation  can  be  writtem  as  a  polynominal 
of  reference  images  (Theorem  2.1)  where  the  reference  images  can  have  arbitrary  large  size.  In 
terms  of  cellular  logic,  the  reference  image  essentially  defines  the  neighborhood  of  a  cell  where 
the  neighborhood  can  be  very  large  and  not  just  nearest  4-  or  8-neighborhood  in  conventional 
cellular  lope.  Thus,  cellular  logic  operations  are  also  particular  cases  of  image  transformations 
with  small  local  reference  images,  and  BIA  also  serve  as  a  systematic  mathematical  tool  for 
formalizing  cellular  logic. 

Because  of  existing  hardware  interconnection  limitations,  it  is  difficult  and  costly  to  imple¬ 
ment  an  image  transformation  with  a  large  reference  image  in  one  clock  cycle.  However,  the 
conventional  nearest-neighbor  connected  cellular  arrays  have  poor  communication  capabilities. 
To  improve  this,  we  develop  the  DOCIP-hypercube  architecure  in  section  5,  which  combines 
features  of  conventional  nearest- neighbor  connected  cellular  logic  architectures  and  conventional 
hypercube  architectures  for  implementing  BIA  effectively. 

In  summary,  BIA  provides  a  systematic  mathematical  formalism  for  both  symbolic  substi¬ 
tution  and  cellular  lope.  The  applications  of  symbolic  substitution  and  cellular  logic  can  be 
accomplished  by  BIA;  on  the  other  hand,  generalized  cellular  logic  architectures  are  good  can¬ 
didates  for  implementing  BIA. 

4.3  Relationship  to  Linear  Shift  Invariant  Systems,  Convolution,  and  Corre¬ 
lation 

It  is  well  known  that  the  theory  of  linear  shift-invariant  (LSI)  systems  plays  a  key  role  in 
conventional  signal  (including  image)  and  system  analysis  [39][40|.  It  is  very  natural  that  we 
like  to  ask  what  the  relation  between  BIA  and  LSI  system  theory  is.  A  system  is  defined  as  a 
transformation  or  mapping  from  a  set  of  input  functions  into  a  set  of  output  functions,  and  a 
two  dimensional  discrete  LSI  system  is  defined  as  a  system  which  obeys  two  properties: 


•  Linearity:  T[ax(i,  j)  +  bz(i,j)]  =  aT[x{i,j)\  +  bT[z{i,j)]  for  arbitary  constants  a  and  b; 


•  Shift-invariance:  y(i,j)  =  T[x(i,j) j  -  y(i  -  k,j  -  l)  =  r[x(i  -  k,j  -  /)]. 

A  linear  system  can  be  completely  characterized  by  its  unit-impulse  response  r(t,  j;  fc,  /)  =  T[(5(t  - 
fc.j-OJ*  la  aa  LSI  system  the  unit-impulse  response  is  simply  r(i,j-,k,l)  =  r(i-k,j~l),  and  the 
output  of  an  LSI  system  with  input  x(i,  j)  and  unit-impulse  response  r(i,  j)  is  the  convolution 
of  x(i,j)  and  r(i,  j),  denoted  by 

OO 

x(ij)*r(ij)s  23  x(fc,  /)r(j  —  k,j  —  /). 

oo 

Now,  let  us  consider  only  binary  images.  In  terms  of  the  set  notation,  an  image  X  =  {(*,  j)  | 
x(t,j)  =  1}  corresponds  to  function  x(»,j).  If  we  assume  r(i,j)  =  1  at  and  only  at  n  points 
which  corresponds  to  an  image  R  with  n  image  points,  then  the  convolution  of  x(i,j)  and  r(i,  j) 
with  a  threshold  t  =  0  is 


X*R\t=Q  =  {(!,;)  |  £*(fc,/)r(»  -k,j-l)>  0} 

*,/ 

*  {(*  +  fc,j  +/)  |  ^x(k,l)r{i,j)  >  0} 

k,l 

=  {(*  +  *J  +  0  I  x(k,l)r(i,j)  >  0} 

*  {(<  +  *,i  +  0  Km)  €*,(*,/)€*} 

=  X©£ 

where  the  ouput  of  the  threshold  is  defined  as  1  if  x(», ;')  *  r(»,  j)  >  0,  and  is  0  otherwise;  and  the 
universal  image,  as  before,  contains  all  image  points  (k,l),  and  (s'  +  fc,  j  +  i).  This  means 
that  the  dilation  X  ©  R  is  the  same  as  adding  a  threshold  t  s  0  to  the  convolution  sum.  The 
reference  image  plays  a  role  similar  to  that  of  the  unit  impiuse  response  in  the  binary  image 
system.  Similarly  the  erosion  X  0  iZ  is  the  same  as  the  convolution  x(*,j)  *  r(— t,  -j)  followed 
by  the  threshold  Is#  — 1. 

Correlators  have  been  used  in  pattern  recognition  for  a  long  time  [41].  Correlation  is  strongly 
related  to  convolution:  convolution  involves  folding,  shifting  and  summing;  correlation  involves  * 
shifting  and  summing  without  folding.  Therefore, 

X  ©  R  —  X  *  R  |tso=  X  o  R  |tao 

X  ©  R  —  X  *  R  |  essn— 1 =  X  OR  |t=n-l 

where  «  means  convolution,  o  means  correlation,  and  R  means  the  reflected  image  of  R. 

Furthermore,  although  the  three  fundamental  operations  of  BTA  are  nonlinear,  with  appropri¬ 
ate  number  representations  they  are  able  to  implement  parallel  numerical  and  linear  operations 
too.  Also,  BIA  can  implement  both  shift  invariant  and  shift  variant  image  transformations. 

4.4  Some  Standard  Algebraic  Structures 

Some  algebraic  structures  supported  by  BIA  are  in  the  following: 


1.  (P(W);©)  is  a  semigroup. 

2.  (P(W);®,/)  is  a  monoid. 

3.  (P(W);  A,<£,  A)  is  an  abeiiau  group. 

4.  (P(W);  u,  n,  ~,<p,  W)  and  (P(W);  A,  n,  ",(£,  W)  are  Boolean  algebras. 

5.  (P(W);C)  is  a  poset  (partially  ordered  set). 

6.  (P(W);  u,  fl,  C)  is  a  complete  lattice. 

Proof:  (1)  A  semigroup  is  a  set  with  an  associative  binary  operation  [30)-{32j.  By  Theorem  3.1, 
the  dilation  3  is  associative  for  all  images  in  P(W). 

(2)  A  monoid  is  a  semigroup  with  an  identity  [30j-[32j.  By  Appendix  D,  the  dilation  has  an 
identity  /  =  {(0,0)}.  Note  that  (P(W);  9)  is  neither  a  semigroup  nor  a  monoid. 

(3)  A  group  is  a  monoid  in  which  every  element  has  an  inverse.  An  abelian  group  is  a  group 
in  which  the  operation  is  commutative  [30j-(32].  By  the  definition  of  symmetric  difference  (mod 
2  image  addition),  it  can  be  easily  verified  that  its  identity  is  <p  and  its  inverse  operation  (mod 
2  image  subtraction)  is  itself. 

(4)  A  boolean  algebra  is  a  set  with  operations  v,  a,  ",0  and  1  satisfying;  1.  avis  iva, 
a  A  6  =  b  A  a  (commutativity);  2.  a  V  (6  A  c)  =  (a  V  b)  A  (a  v  c),  a  a  (6  v  c)  =  (a  A  b)  v  (a  A  c) 
(associativity);  3.  avO  =  a  (universal  bound);  4.  aAl  =  a  (universal  bound);  5,  avff  =  1,  aAff  =  0 
(complementarity)  [30]-{32].  By  Appendix  D,  (P(W);U,n,_,^,W)  and  (P(W);  A,n,  ~,<j>,  W) 
are  Boolean  algebras. 

(5)  A  poset  is  a  set  with  a  relation  satisfying;  1.  the  reflexivity;  2.  the  antisymmetry;  and 
3.  the  transitivity  [30j-[32j.  The  relation  C  satisfies  these  three  conditions:  1.  X  C  X  for  all 
X  €  P{W );  2.  if  X  C  R  and  R  C  X,  then  X  =  R;  and  3.  if  X  C  R  and  R  C  Q,  then  X  c  <3- 

(6)  A  complete  lattice  is  a  poset  (5;  <)  in  which  every  subset  of  5  has  a  sup  (the  least  upper 
bound)  and  an  inf  (the  greatest  lower  bound)  [30]-[32].  In  the  algebra  (P(W);U,n,  C),  given 
any  subset  of  P(W ),  say  {X(0)  |  9  €  9}  (©  is  the  index  set  of  the  elements  in  this  subset  of 
P(W)),  we  have 

aup=  (J  X(9) 

inf  =  fj  X{9).  . 

fee 

Thus,  several  standard  algebraic  structures  and  their  properties  can  be  directly  implemented 
and  used  in  BIA. 


5  Implementation  on  Optical  Cellular  Logic  Processors 

To  map  algorithms  into  architectures,  we  first  use  an  algebraic  approach  for  describing  a  cellular 
image  processor.  Then  we  design  the  digital  optical  cellular  image  processors  (DOCIPs)  and 
their  optical  implementation.  Figures  15  and  16  show  an  optic.,  concept  for  the  DOCIP  imple¬ 
mentation.  The  optical  system  can  realize  an  array  of  cells  by  a  spatially  parallel  2-D  array  of 
optical  binary  gates  and  performs  interconnections  of  these  gates  by  an  optical  hologram.  The 
DOCIPs  are; 
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•  The  DOCIP -array  (Fig.  15),  a  cellular  array  processor,  uses  optical  parallelism  to  map  an 
inherently  2-D  parallel  image  data  structure  to  a  2-D  nearest- neighbor  connected  cellular 
computer  in  a  simple  and  direct  way.  Its  performance  is  primarily  limited  by  its  0(1) 
interconnectivity. 

•  The  DOCIP -hypercube  (Fig.  16),  a  two-dimensional  cellular  hypercube,  uses  optical  par¬ 
allelism  and  the  3-D  global  interconnection  capabilities  of  optics  to  implement  a  hypercube 
interconnection  mechanism. 

Here,  the  two-dimensional  cellular  hypercube  is  used  to  match  the  structure  of  a  two- 
dimensional  image  and  further  improve  the  communication  ability  of  a  cellular  array.  Ideally, 
a  conventional  hypercube  (Fig.  IT)  increases  the  interconnectivity  to  0{logN )  for  .V  compu¬ 
tation  cells;  however,  when  laid  out  in  two-dimensional  space,  its  interconnection  patterns  are 
not  space  invariant;  such  spatial  invariance  is  desirable  for  image  processing  and  for  simple  im¬ 
plementation  in  optical  hardware.  To  include  this,  we  increase  the  interconnections  to  make  a 
two  dimensional  cellular  hypercube  (Fig.  18).  The  cellular  hypercube  introduces  a  symmetrical 
positive  and  negative  index  so  that  each  cell  is  connected  with  ceils  having  a  relative  one  bit 
difference  in  coordinate  label  in  positive  or  negative  i  and  y  directions;  the  numerical  difference 
of  addresses  of  connected  cells  is  nonzero  in  at  most  1  bit  [42], 

5.1  Algebraic  Description 

Having  defined  cellular  automata  and  the  implementation  requirements  of  BIA,  we  describe  the 
DOCIP  in  an  algebraic  way: 

Definition  of  Cellular  Automata 

A  cellular  automaton  is  an  algebra  A  —  (5;  F,  Nc)  where  5  is  the  state  space  which 
is  a  set  of  states,  F  is  a  family  of  transition  functions,  and  Nc  is  the  neighborhood 
configuration. 

Constraints  of  Implementing  BIA: 

l.SD  P{W) 

2.  F  D 

3.  Ne  D  i  U  AU  A”1  U  B  U  B~l  (or  Ne  D  Au  A~l  U  B  U  B~l) 
where  “D"  means  “contains”. 

Thus,  in  terms  of  cellular  automata,  the  DOCIPs  have  to  satisfy  the  above  constraints  for 
realizing  BIA.  For  storing  input  images  and  temporary  results  in  a  more  flexible  way,  the  DOCIPs 
utilize  three  memory  modules  and  share  the  same  algebraic  structure  (except  the  neighborhood 
configuration): 

DOCIP  =  (P(W  x  W  x  V7);®,U,-,;Ve) 

where  “x”  denotes  cross  product  and  Ne  can  be  one  of  the  following  4  types: 

1.  DOCIP-array4:  each  cell  connects  with  its  four  nearest  neighbors  and  itself,  i.e. 

Ncmn  ~  lu  A  U  A~l  U  B  U  B~l. 
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2.  D0CIP-array8:  each  cell  connects  with  its  eight  nearest  neighbors  and  itself,  i.e. 

^arrays  =  U  A'B 

3.  DOCIP -hypercube4:  each  cell  connects  with  those  cells  in  the  4  directions  at  distances 
1,2, 4, 3,...,  2*  from  itself,  i.e. 

^hypercubck  ~  (A*  U  S') 

■=o.±l.±2 . ±2* 

where  k  is  sufficiently  large  for  the  connections  to  traverse  the  entire  array  of  cells. 

4.  DOCIP-hypercubed:  each  cell  connects  with  those  cells  in  the  8  directions  at  distances 
1,2, 4, 3,...,  2*  from  itself,  i.e. 

^Kypercubea  =  {J  (A*  U  Bi  U  A'B'  U  A'B~') 

i=oil,±2 . ±2* 

5.2  General  Description 

From  the  above  algebraic  description,  the  DOCIPs  have  the  same  algebraic  structure  and  differ 
only  in  their  neighborhood  configurations  iYe.  Thus,  they  share  the  3ame  architecture  as  shown 
in  Fig.  19,  but  have  different  configurations  of  the  reference  images  E,  depending  on  the  optical 
interconnection  network  which  defines  the  neighborhood.  In  practical  applications,  a  larger 
reference  image  R  can  be  generated  from  a  set  of  smaller  reference  image(s)  E%  by  a  “sequential 
dilation”.  If  it  is  possible  to  decompose  R  into  a  sequence  R  »  E\  0  Ei  ©  ...  ©  Ey,  then 

X  ©  R  =  (—((X  ©  Ei)  ©  Ei)  ©  ©  Ei,). 

This  decomposition  may  not  exist,  in  which  case  R  can  always  be  decomposed  as  R  =  R\  u  Ri  U 
...  u  Ry,  and  then 


X  ©  R  =  (X  ©  Rx)  U  (X  ©  R2)  u  ...  u  (X  ©  Ry) 

where  each  R:  can  be  composed  from  the  smaller  reference  images  £,. 

Basically,  the  proposed  DOCIP  as  shown  in  Fig.  19  is  a  cellular  SIMD  machine  and  consists 
of  an  array  of  cells  or  processing  elements  (PEs)  under  the  supervision  of  a  control  unit.  The 
control  unit  includes  a  clock,  a  program  counter,  a  test  and  branch  module  for  feedback  control, 
and  an  instruction  decoder  for  storing  instructions  and  decoding  them  to  supervise  cells.  The 
array  of  cells  includes  a  1  x  3  x  IVs  bit  destination  selector,  three  N  x  N  x  1  bit  memories  for 
storing  images,  a  memory  selector,  and  a  dilation  unit. 

The  DOCIP  shown  in  Fig.  19  operates  as  follows:  (1)  a  binary  image  (N  x  N  matrix)  is 
selected  by  the  destination  selector  and  then  stored  in  any  memory  as  the  instruction  specifies; 

(2)  after  storing  the  images  (1  to  3  N  x  N  matrices),  these  images  and  their  complemented 
versions  are  piped  into  the  next  stage,  which  forms  the  union  of  any  combination  of  images; 

(3)  the  result  is  sent  to  a  dilation  where  the  reference  image  specified  by  the  instruction  is  used 
to  control  the  type  of  dilation;  (4)  finally,  the  dilated  image  can  be  output,  tested  for  program 
control,  or  fed  back  to  step  (1)  by  the  address  field  of  the  instruction. 


78 


The  entire  system  can  be  realized  by  an  optical  gate  array  with  optical  3-D  interconnections 
[25]-{28j.  It  should  be  noted  that  current  optical  technology  has  implemented  only  arrays  of 
moderately  large  numbers  of  gates  (500  x  500)  at  very  slow  (~ms)  switching  speeds,  and  alter¬ 
natively,  arrays  of  small  numbers  of  gates  (2  x  2  to  6  x  6)  at  fast  switching  speeds  (O.lja  -  50ps) 
[43][44].  Current  ongoing  research  in  a  number  of  laboratories  looks  promising  in  eventually 
providing  the  needed  arrays  of  large  numbers  of  gates  with  reasonably  fast  switching  speeds. 
Alternatively,  control  of  the  DOCIP  can  be  easily  realized  by  using  an  electronic  host  instead  of 
the  optical  control  unit,  since  control  of  SIMD  systems  is  primarily  a  serial  process.  The  tradeoff 
is  a  possible  inefficiency  in  the  interfaces  between  electronic  and  optical  units.  Because  of  this, 
the  all-optical  approach  may  be  preferable  in  the  long  term.  To  efficiently  utilize  optical  gates, 
they  can  be  interconnected  with  a  2-D  optical  multiplexing  technique  in  which  a  common  con¬ 
trollable  mask  is  used  for  all  cells.  The  optical  multiplexing  technique  has  following  advantages: 
1)  the  DOCIP  will  no  longer  require  the  broadcasting  of  instructions  from  the  control  unit  — 
instead  all  cells  fan  their  outputs  into  a  commmon  controlling  mask  pixel;  2)  it  will  reduce  the 
number  of  gates;  and  3)  each  cell  has  a  simple  structure  —  essentially  containing  only  a  3-bit 
memory  with  inverting  and  non-inverting  outputs,  and  a  multiple-input  OR  gate  for  dilation. 

To  avoid  the  well-known  drawbacb  of  conventional  computers  based  on  von  Neumann  princi¬ 
ples  [23][38],  the  machine  in  Fig.  19  has  one  instruction  which  implements  the  three  fundamental 
operations  of  BIA  along  with  fetch  and  store.  This  design  uses  the  parallelism  of  optics  to  si¬ 
multaneously  execute  instructions  involving  all  N 2  picture  elements. 

This  single  instruction  has  the  following  format: 

(3i,sz,...,s6,n1,n5,...,nfc,di,d2,d3,;x,j2,a1,a2,...,ai,61,b2,...,6|) 

where  k  is  determined  by  the  chosen  neighborhood  configuration  Ne.  The  DOCIP-array  requires 
k  s  5  or  k  =  9  bits  for  controlling  reference  image  R  at  a  clock  cycle  and  the  DOCIP-hypercube 
requires  k  =  O(logN)  for  N  cells,  and  l  defines  the  maximum  length  of  a  program:  2i.  The 
functions  of  these  ll  +  k +  21  instruction  codes  are  the  following: 

•  si,S2,...,s«  are  used  to  select  the  output  from  the  memory  elements; 

•  ni,  n-i, ...,  rat  are  used  to  control  the  neighborhood  mask,  i.e.  to  supply  the  reference  image; 

•  i\,  di,  and  d3  are  used  to  select  the  destination  memory  for  storing  the  image; 

•  j\  and  ji  are  used  to  flag  an  absolute  jump  or  conditional  jump; 

•  ax,  aj,  are  the  address  for  jump;  and 

•  are  the  address  of  the  instruction. 

Order  of  magnitude  execution  times  for  image  processing  on  the  DOCIP  machines  and  on  the 
conventional-array  processors  are  compared  in  Table  2.  In  contrast  with  the  DOCIP-array,  the 
DOCIP-hypercube  increases  the  interconnection  complexity  to  0(logN),  but  is  able  to  perform 
many  global  operations  in  O(logN)  time.  Comparing  with  the  conventional-array  processors 
having  serial  or  N-parallei  input/output,  the  DOCIP-array  will  have  the  same  order  of  perfor¬ 
mance  in  local  and  global  operations  but  will  be  improved  in  input/output  performance,  and  in 
principle  <■  uld  be  as  low  as  0(1)  in  I/O  operations.  The  DOCIP-hypercube  will  not  only  be 
improved  lu  input/output  performances  but  also  in  global  operations.  With  external  memory,  it 


79 


f 


can  also  be  demonstrated  to  be  general  purpose  in  the  sense  of  the  ability  of  simulating  any  Tur¬ 
ing  machine.  One  important  feature  in  the  design  of  the  DOCIP-array  and  DOCIP-hypercube 
is  that  optical  3-D  free  interconnection  capabilities  can  be  used  to  reduce  the  cell  hardware  re¬ 
quirements  as  well  as  solve  the  global  connection  and  I/O  problems  which  are  difficult  to  solve 
by  planar  VLSI  technology. 


6  A  Programming  Example  —  Size  Verification 

BIA  and  DOCIP  architectures  can  have  many  applications  in  character  recognition,  industrial 
inspection,  medical  and  scientific  research.  Since  BIA  is  able  to  implement  morphological  op¬ 
erations  efficiently,  the  DOCIP  machines  can  efficiently  analyze  the  shape  and  connectivity  of 
regions  as  well  as  measure  their  size;  they  also  have  the  potential  to  accomplish  any  image 
transformation.  Here  we  illustrate  the  programming  of  the  DOCIP  machines  by  a  simple  size 
verification  algorithm: 

•  Problem:  Given  an  input  image  X  with  31  x  31  pixels  (Fig.  20)  which  contains  some 
square  objects  X„  we  want  to  preserve  those  square  objects  X,  which  satisfy  the  following 
condition: 


size  of  R  <  size  of  Xt  <  size  of  Q 

where  R  and  Q  are  reference  images  as  shown  in  Fig.  21.  Other  objects  will  be  eliminated 
in  the  output  image  Y.  The  expected  output  image  Y  is  shown  in  Fig.  22. 

•  Algebraic  expression  for  the  size  verification  using  band  pass  morphological 
filtering  (Theorem  3.2): 


(X®  Re  R)u(?®Q®Q) 

where  R  =  R  and  Q  =  Q  in  this  special  example. 

•  Algorithm  for  the  DOCIP-array8: 


(X©£3©  £3)u(X©  £«©£«) 

where  E  (Fig.  23)  is  the  allowed  reference  image  with  the  maximum  size  at  a  dock 
cyde  in  the  DOCIP-array8,  the  reference  images  R  =  E3-E®E®E  and  Q  =  E*  = 
£©£©£©£=!*©£. 

The  DOCIP-array8  requires  13  steps  to  complete  this  algorithm,  its  program  (instructions) 
is  in  the  following: 

Assume  start  with  X  -*  M\  (X  stored  in  Memoryl) 

1.  37T©  E  —  Mi  (=  X ©  E) 

2.  Af3©£-itf2(=X©£2) 

3.  Mi  ©  E  -*  M7  (=  X  ©  E3) 

4.  Mi  ©  E  — *  M3  (=  X  ©  E *) 

5.  377©  E  —  M3  (=  X©£3  ©  E) 
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6.  M2  ©  £  -  Mt  (=  X©£3  ©  £2) 
r.  jf,  ©  £  -  m2  (=  X©  £3©  £3) 

8.  377©  £  -  Af3  (=  X©£4©  £) 

9.  -Vf3  $  5  -  Af3  (=  X©  £4  ©  £2) 

10.  M3®E-  M3  (=  1 3  E*  3  £3) 

11.  Af3  ®  £  -»  Mi  (=  X 8  £4  0  £4) 

12.  377 u  .v/3  -  .vf3  (=  (Xe£3©£^)u(Xe  £<©£«)) 

13.  End  with  377  -  Y  (=  (X 9  £3  9  £3)  u  (X©  £4  9  £4)) 

•  Algorithm  for  the  DOCLP-hypercubeS: 


(X©P©£0P©£)u(X©P©£2©P©£2) 

where  P  (Fig.  24)  and  £  (Fig.  23)  are  allowed  reference  images  at  a  clock  cycle  in  the 
DOCIP-hypercube8,  the  reference  images  R  =  £3  =  P©£  and  Q  =  £4  =  P©£*2  =  £©£. 

The  DOCIP -hypercub  e8  requires  10  steps  to  complete  this  algorithm,  its  program  (instruc¬ 
tions)  is  shown  in  the  following: 

Assume  start  with  X  —  .V/>  (.Y  stored  in  Memoryl) 

1.  377©  P  -  M2  (=X©P) 

2.  Afa  0  E  ~  Af2  (=  X©  P©  £) 

3.  AT,  ©  £  -  Af3  (=  X©  P  ©  £2) 

4.  377 ©  P  — *  Af2  (=  X©P©  P2 ©  P) 

5.  .Vf2  ©  £  -  Afa  (=  T©P©F  ©  P  ©  £) 

0-  377  ©  P  — *•  Af3  (=  .Y  ©  P  ©  P2  ©  p) 

T.  Jtf3  ©  £  -  M3  (=  X©  P©  P2  ©  P  ©  £) 

8.  ,Vf3  ©  £  -  .Vf3  (=  X©P©£2©P©£2) 

9.  A?7u  Af3  -  Jf3  (s  (X©P©£©P©£)u(X©P©£2©P©£2)) 

10.  End  with  377  -  r  (=  (X9  P©  £  ©  £)  u  (X@P®  P2  ©  P  ©  P2)) 

The  above  programs  can  be  translated  into  the  machine  instruction  codes  directly.  If  we  want 
to  detect  the  geometric  centers  (locations)  of  the  desired  objects,  then  we  can  use  a  sequential 
thinning  to  achieve  the  homotopic  skeleton  (Theorem  3.5)  (Fig.  25). 

7  Conclusions 

We  have  summarized  digital  optical  cellular  image  processing,  including  binary  image  algebra 
(BIA)  and  the  DOCIP  architectures.  BIA  suggests  an  unified  theory  of  parallel  binary  image 
processing  for  developing  parallel  algorithms/languages  and  can  be  generalized  to  grey-level  im¬ 
ages.  Applications  of  BIA  in  binary  image  processing  are  illustrated.  The  DOCIP  architectures, 
especially  the  DOCIP-hypercube,  utilize  the  parallel  communication  and  global  interconnection 
capabilities  of  optics  for  avoiding  communication  bottlenecks  and  matching  BIA  parallel  algo¬ 
rithms  efficiently.  A  size  verification  algorithm  is  used  to  demonstrate  the  programming  of  these 


1-instruction  DOCIP  machines.  Overall,  BIA  is  a  simple,  precise  and  complete  algebraic  the¬ 
ory  of  b'nary  images;  the  DOCIP  machines  have  simple  organization,  low  ceil  complexity  and 
potentially  fast  processing  ability. 
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Appendix  A 

Proof  of  Lemma  2.1: 

We  start  with  the  case  of  X  =  R  and  then  the  case  of  X  £  R. 

Case  1:  X  =  R,  i.e.  R  =  X. 

We  want  to  prove 

(l3i)u(I®I)u7  =  I «  (X®  X)  u  (X  ©  X)  n  I  =  /. 

1.  Claim  /  C  (79,?)U(.YS?)  n  / 

«(0,0)6(I®.t)u(.Y®Y) 

«(0,0)?(7®Jf)u(T®I) 

-  [(0, 0)g(X©  X))  A  [(0,  0)  g  ( X  ©  X))  : 

(a)  Claim  (0,0)  g  (X ©  X): 

Assume  (0,0)  6  (X  ©  X) 

—  (0,0)  €  {(a  +  (-z),4  +  (-y))  6  W  |  (a, 6)  £  X,(-z,-y)  £  X} 

~  (0, 0)  £  {(a  -  x,  4  -  y)  £  W  |  (a,  4)  g  X,  (z,  y)  £  X } 

—  3(a  -  z,  b  -  y)  =  (0, 0)  where  (o,  b)  g  X,  (z,  y)  £  X 
-*  3(z,y)  =  (a, 6)  where  (a,4)  g  ^(z.y)  £  X 

which  is  impossible,  since  (z,y)  =  (a,  b)  g  X  contradicts  with  (z,y)  =  (a,  b)  £  X. 
Therefore,  the  assumption  is  wrong,  we  have  that  (0, 0)  g  (X  ©  X). 

(b)  Claim  (0,0)  g  (X©A> 

Assume  (0,0)  €  ( X  ©  X)  _ 

—  (0,0)  £  {(z  +  (-a),y  +  (-&))  €  W  |  (z,y)  £  X,(-a,-4)  £  X} 

—  (0, 0)  £  {(z  -  a,  y  -  4)  £  W  |  (z,  y)  £  X,  (-a,  -4)  g  X} 

—  (0,0)  £  {(z-a,y-4)£  W|  (z,y)  £  7,(«,6)  g  X) 

—  3(z  -  a,y  -  4)  =  (0,0)  where  (a,  6)  g  X,(z,y)  £  X 

—  3 (z,y)  =  (a,  6)  where  (a,  4)  g  X,(x,y)  €  A 

which  is  impossible,  since  (z,y)  =  (a,  6)  g  X  contradicts  with  (x,y)  —  (a,  4)  £  X. 
Therefore,  the  assumption  is  wrong,  we  have  that  (0,0)  g  (X  ©  X). 

By  (a)  and  (b),  we  have  ((0,0)  g  (X©  X)J  a  [(0,0)  g  (X  ©  X)j,  i.e. 

I C  (X©X)u(X©  A). 

We  also  know  /  C  /,  then  we  have 

/C(X©X)u(X©X)n  /. 
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2.  Claim  (X®  X)  U  (X  0  X)  u  7c  I : 

Since  /  C  /,  it  implies 

(l9.t)u(X9?)uI=(I®^)u  (X®  X)  n  /  c  /. 

From  (1)  and  (2),  we  have  _ 

/c  (X9X)u(xsx)n/ 


(X 9  X)  u  (X  9  X)  u  7=  (X 9  X)  u  (X  0  X)  n  /  C  /. 

Tims,  by  the  equivalence  of  sets,  we  have  (X  9  X)  U  (X  9  X)  n  I  =  I. 

Case  2:  X  #  R ,  i.e.  JZ  #  X. 

We  want  to  prove 

(X  9  A)  u  (X  0  X)  u  7  =  *  ~  (X  0  JZ)  u  (X  9  X)  n  I  =  * 

1.  Claim  /  <t  (Xafl)u(X9ff) 

~  (0,0)  <t  (XeR)  u(xal) 

-  (0, 0)  6  (X  0  JZ)  U  (X  0  X) 

—  (0, 0)  €  (X  0  R)  V  (0, 0)  €  (X  0  X): 

Now  we  assume  (0,0)  g  (X  0  R)  A  (0,0)  g  (X  0  If) : 

(a)  If  (0,0)  £  (X  0  R) 

-  (0,0)  g  {(a  +  k,b  +  1)  |  (a,6)  6  X,(fc,i)  €  R} 

-  (a  +  fc,6  +  0  ¥>  (0^0).  e  V(M)  e  5 

-  (a,  6)  #  (-*,-/),  *(«,*)  *  X,  V(*,Z)  6  A 

-  V(Jb,0  €  JZ,  3(a,  6)  €  X,  (a,  5)  =  (-*,-/) 

-  V(-*,-/)  €  5,  3(a,6)  €  X,  (a, 6)  =  (-*,-0 
-V(i,;)€  JZ,  3(«,*)6X,  («,*)*  (*,i) 

-  ((»,;)  6  A) -*(»,;)  6  X) 

-+RCX. 

(b)  If  (0,0)  £  (X  0  X),  then  X  C  R-  Since  the  dilation  operation  is  commutative,  by 
interchanging  the  variables  X  and  R  and  applying  the  same  procedure  as  (a),  we 
have  X  C  R. 

2.  By  the  above  (a)  and  (b),  we  have  X  =  R  which  contradicts  with  X  ^  R.  Thus,  the 
assumption  is  wrong,  and  we  get  the  following  result: 

~  (0,0)  6  (X 0  JZ)  V  (0,0)  €  (X  0  X) 

~/g  (X0JZ)U(X®X) 

<- (X®fZ)U(X0X)n/  =  ^ 

~  (X  0  X)  u  (X  0  X)  u  7  = 

Hence,  by  case  1.  and  case  2.,  we  have  shown  that 

(Tes)u(rea)ur.{j 
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Appendix  C 

Proof  of  Theorem  2.2:  This  can  be  shown  in  a  very  straight  forward  way.  Any  image  is  a  set 
of  image  points  and  is  the  union  of  point  images  (  consisting  one  and  only  one  image  point).  A 
point  image  {(»,;)}  can  be  written  as 

{(*,;)}  =  A' BP 

Hence,  the  union  of  all  point  images  which  are  contained  in  X  is  the  image  X.  For  example,  an 
image  X  -  {(2,0),(1,  — 1),  (— 1, 2)}  is  denoted  by  , 

X  =  A2UAB~1UA-1B2. 


Appendix  D 

1 .  Properties  of  Complement  and  Difference 

The  complement  a  unary  operation,  is  decreasing  and  shift  variant  (considering  the  outside 
of  an  image).  The  difference  X/R,  a  binary  operation,  is  increasing  (but  decreasing  with  respect 
to  the  reference  image  R),  anti  extensive  with  respect  to  X,  and  shift  variant  (the  reference 
image  R  is  fixed  once  it  is  given).  Note  that  the  difference  operation  is  not  commutative,  not 
associative,  and  not  distributive  over  other  operations.  Furthermore,  the  difference  operation  is 
more  complicated  than  the  complement.  Hence,  it  is  preferable  to  employ  the  complement  as 
a  fundamental  operation,  but  not  the  difference.  The  major  properties  of  the  complement  and 
the  difference  are  listed  in  the  following; 
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1.  X  =  w/x 

2.  X/R  =  XnJ 

3.  ©  =  W 

4.  W  =  <j> 

5.  X  =  X  (idempotent  for  twice  complements) 

6.  X/<p  =  X  (idempotent  for  a  given  reference  image  R  =  <p) 

7.  X/X  =  ® 

S.  X  C  K  *»  F  C  X  (decreasing) 

9.  X  CY  ~  X/R  C  Y/R  (increasing) 

10.  X/R  C  X  (anti extensive) 

11.  X  CR~  X/R  =  <t> 

12.  XnT  =  X  u  7? 

13.  TU7F=  Xn  X 
14-  JfnX  =  ^ 

15.  XuX  =  W 

16.  X  ®  R  =  X©  R  where  R  =  {(-x, -y)  |  (x,y)  €  R} 

17.  X  ©  R  =  X©  R  where  R  =  {(-x,  -y)  |  (x,  y)  6  fZ} 

2.  Properties  of  Union  and  Intersection 

The  union  u,  a  binary  operation,  is  increasing,  extensive,  shift  variant,  idempotent,  commu¬ 
tative,  associative,  and  distributive  over  intersection.  The  intersection  D,  a  binary  operation, 
is  increasing,  anti  extensive,  shift  variant,  idempotent,  commutative,  associative,  and  distibutive 
over  union.  The  major  properties  of  the  union  and  the  intersection  are  listed  in  the  following: 

1.  X\J<t>  =  X 
X  n  <f>  — 

2.  X  U  X  =  X 
Xn  X  =  X 

3.  X  U  R  =  Ru  X  (commutative) 

X  n  R  =  R  n  X  (commutative) 

4.  X  U  (R  U  Q)  =  (X  U  R)  U  Q  (associative) 

Xn(AnQ)  =  (X  n  R)  n  Q  (associative) 

5.  XU  W  =  W 

X  nW  =  X  (idempotent  for  a  given  reference  image  R  =  W) 
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6. 

X 

u 

(R  n  Q)  =  (Jf  u  R)  n  ( X  u  Q)  (distributive) 

X 

n 

(R  U  Q)  —  (X  n  R)  u  (X  D  Q )  (distributive) 

7. 

X 

c 

Xu  R  (extensive) 

X 

n 

R  C  X  (antiextensive) 

3. 

X 

c 

Y  —  Xu  R  CY  u  R  (increasing) 

X 

c 

Y  —  X  n  R  cY  O  R  (increasing) 

9. 

X 

c 

R  -  Xu  R  -  R 

X 

c 

R- XnR  =  X 

10. 

R 

c 

X  A  Q  C  X  -  R  u  Q  C  X 

X 

c 

RaX  CQ  -  X  C  RuQ 

11. 

R 

c 

XaQcY-RuQcXuY 

R 

c 

X  aQ  cY - RnQ  cXnY 

3.  Properties  of  Dilation  and  Erosion 

The  dilation  ©,  a  binary  operation,  is  increasing,  extensive  for  a  given  reference  image  R 
which  contains  the  elementary  image  I,  shift  invariant,  commutative,  associative,  distributive 
over  union,  and  possesses  an  identity  which  is  I.  The  erosion  0,  a  binary  operation,  is  shift 
invariant,  increasing  (but  decreasing  with  respect  to  the  refemce  image  R),  antiextensive  for  a 
given  reference  image  R  which  contains  the  elementary  image  I.  But,  in  general,  the  erosion  is 
not  commutative,  not  associative,  not  distibutive  over  other  operations,  and  does  not  possesses 
a  left  identity.  The  major  properties  of  the  union  and  the  intersection  are  listed  in  the  following; 

1.  X  ®  R  ■=  RQ  X  (commutative) 

X  9  R  56  RQ  X  (in  general) 

2.  (X  ©  R)  0  Q  =  X  ©  {R  ©  Q)  (associative) 

(XQ  R)QQ  £  X  0  (A  9  Q)  (in  general) 

( XQR)QQ  =  (XeQ)QR 

3.  X  ®  (Ru  Q)  -  ( X  ©  R)  U  [X  ©  Q)  (distributive) 

Xe{RuQ)  =  {X  QR)n(X9Q) 

xq(rqq)  =  ( xeR)eQ 

4.  XQI  =  X  —  I®X  (identity) 

X©/  =  X#/©X(in  general) 

5.  X  ©d>  =  <t>  =  £©  X 

X  Q  <p  -  W  <£Q  X  (in  general) 

6.  X  C  X  ©  R  when  I  C  R  (extensive) 

X  ©  R  C  X  when  I C  R  (antiextensive) 

7.  XcT  —  X  R  CY  Q  R  (increasing) 

X  C  Y  ~  X  Q  R  CY  Q  R  (increasing) 
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8.  RCQ  ~X&RcXeQ 
RCQ ~XQQC XQR 

9.  X  ®  (R  n  Q)  c  (X  ®  R)  n  (X  ©  Q)  (distributive  inequality) 

XQ(RnQ)  D  (JfQ  A)U  (X©<5) 

(Xu  Y)QRD  ( XQR)r\(YQR ) 

C XQR)QQ  C  (X  ©£)©<? 

Remark:  “D”  means  “contains". 

4.  Properties  of  some  standard  operations 

1.  The  symmetric  difference  is  shift  variant  (with  a  fixed  reference  image  R),  commutative 
and  associative.  Symbolically, 

(a)  X  AR  =  RAX 

(b)  X  A{R  AQ)  =  (X  AR)  AQ 

(c)  X  A  <p  =  X 

(d)  XA  X  =  <(> 

(e)  XAT-W 

(f)  X  AW  —  X 

(g)  Xn(RAQ)  =  (XnR)A(XnQ) 

(h)  Xu(JSAg)#(XuR)A(XuQ)  (in  general) 

(i)  XAR  =  YAR-+X  =  Y 

2.  The  opening  o  is  shift  invariant,  increasing,  anti-extensive  and  idempotent.  The  closing  • 
is  shift  invarinat,  extensive,  and  idempotent.  Symbolically, 

(a)  XoRcXcX»R 

(b)  XCY -X0RCY0R 

(c)  Xc y-x.Rcy.R 

(d)  (X  0  J2)  0  R  =  X  0  R 

(e)  (X  •  R)  •  R  =  X  •  R 

3.  The  thinning  is  shift  invariant  and  antiextensive.  The  thickening  is  shift  invariant  and 
extensive.  The  major  properties  are  in  the  following: 

(a)  XQRcXcXQR 

(b)  X  CY  -*  XQRqYQR 

(c)  XC  Y  ^  XQRCYQR 

(d)  If  R  C  Q  (which  means  Ri  C  Q 1  and  R 2  C  Qj),  then  we  have: 

R  C  Q  -*  X  ®  R  <Z  X  ©Q  C  X  C  X  Q  Q  C  X  Q  R. 

(e)  XQR  =  X@Rm  where  R  =  {Ri.Ra}  and  R'  =  {Rj,Ri}. 
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Appendix  E 


Proof  of  Theorem  3.3:  We  can  easily  see  that  (2)  in  Therorem  3.3  is  a  generalization  of  (1) 
in  Theorem  3.3.  (1)  is  used  for  exactly  matching  shapes  (or  tempiates)  with  shift  invariance; 
(2)  is  generalized  to  more  general  cases.  For  example,  to  consider  noise  and  to  have  rotational 
invariance,  we  can  choose  the  family  {.£(0)}  to  incorporate  all  aspect  reference  image  pairs.  In 
the  following,  we  prove  (1)  and  then  (2)  will  follow  from  it  directly.  The  proof  will  demonstrate 
the  mathematical  correspondance  between  boolean  logic  and  BLA.  The  notations  x(i ,j)  and 
r(i,j)  will  be  used  to  represent  the  binary  values  (0  or  1)  of  pixels  at  coordinate  (i,j)  of  image 
functions  which  correspond  to  the  images  X  and  R  in  BIA  notations. 

First,  let  us  use  the  boolean  logic  XOR  (excusive  or)  operation,  i.e. 

z(i,j)  XOR  r(i,j)  =  (J(i,j)  A  r(i,j))  V  (x(i,j)  A  f(i,j)), 

to  achieve  the  pixel- wise  comparison  where  the  ouput  value  with  ‘0’  means  that  ‘z(:',j)’  matches 
‘r(i,  j)’  and  the  output  value  with  ‘1’  means  that  ‘x(t,  j)’  does  not  match  ‘r(i,  j)\ 

Second,  to  check  the  occurence  of  the  shape  (defined  by  R  with  M)  in  the  tested  image  X 
at  coordinate  (»,  j),  we  have  to  shift  the  origin  of  the  shape  to  the  coordinate  (i,  j)  in  X.  Then 
the  process  of  the  comparison  of  the  shape  and  the  subimage  in  X  (limited  in  the  mask  M)  and 
the  indication  of  “match”  (0)  and  “not  match”  (1)  will  be  performed  by 

V  (I(i  +  i,;'  +  /)Ar(*,i))V  \j  (x(i  +  k,  j  +  /)  A  T{kJ)). 

If  the  above  equation  is  considered  as  a  binary  operation  operating  on  two  images  x{i,j)  and 
r(»,  j),  then  this  operation  is  not  commutative;  in  order  to  achieve  the  commutativity,  we  change 
(k, l)  with  (- k,-l )  and  denote  r(k,l)  -  r(-k,-l ): 

V  (J{i-k,j-l)*f(k,l))v  V  (*(i-k,j-l)AHk,l)). 

(-*,-<)€#  (-*,-<)€,< r 

If  the  output  value  of  the  above  equation  is  ‘O’,  then  it  means  that  the  location  (»,  j)  of  the 
image  X  has  the  occurrence  of  the  shape  (  defined  by  R  and  M);  if  T’,  the  shape  is  not  occurred 
**(*-»• 

Third,  let  us  run  over  all  coordinates  (i,j)  (i.e.  for  ail  (»,  j)  6  W  the  universal  image)  and 
then  the  union  of  those  coordinates  with  value  ‘0’  would  be  the  answer.  The  value  ‘0’  at  a 
coordinate  (i,j)  corresponds  to  the  null  image  in  set  notation  and  the  value  ‘1’  at  a  coordinate 
(t,  j)  corresponds  to  the  point  image  {(i,j)}.  For  convenience,  in  the  following  we  mix  the 
notations  of  boolean  logic  functions  and  set  notations;  if  the  output  of  a  boolean  logic  expression 
is  ‘O’,  it  represents  the  null  image  <fi;  if ‘1’,  it  represents  the  point  image  {(i,;)}.  Thus,  we  have 

U  (  V  V  (i(t-k,j-0Af(k,i))) 

(«.J)€W  /)€A5f  (-*,-l)€Ai 

which  is  the  same  as 

U  (  V  AKM)))U  U  (  V  (x(i-k,;-0  A  *(*,/))). 
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Since  x(i,j )  /  0  only  when  (i,j)  6  X  and  f(k,l)  56  Q  only  when  ( k,l )  6  R ,  we  have 

U  (  V  k,j  -l)*f (k,l)))  =  {(i,i)|(i-i,j-!)6l,(i,!)si} 

(w)€W  (-fc,-oeA5r 

=  {(<  +  *,; +  0 1  (*.i)  €*,(*,/)€  A} 

=  X  © 


Similarly,  we  have 


U  (  V  (x(i-k,j-l)Ar(k,l))-Xe(M/R). 

Me w  (-*,-oevr 


Hence,  if  we  use  'O’  to  indicate  ■‘match” ,  we  have 

(X®  R)u(X  ®(M/R))-, 
if  we  use  T  to  indicate  “match", then  we  have 

(X©ik)U(X  S(M/R)). 

Thus,  the  locations  of  a  shape,  which  is  defined  by  a  non-null  reference  image  R  with  a  non- 
null  reference  image  (called  mask)  Af  and  5  C  M  C  W,  are  the  image  points  in  the  following 

(X®R)u(X®(M/R))  =  (X  ®  A)  u  (X  ©  M  u  R)  =  (X  0  A)  n  (X©  (M/iZ)). 

A  more  intuitive  illustration  is  that  the  foreground  X  should  match  R  by  X  Q  R  (using 
multiple-input  AND  gates  to  examine  the  locations  where  the  l’s  should  be),  while  the  back¬ 
ground  X  should  match  M/R  by  Xq(M/ .ft).  Combining  both  results  by  the  intersection  (AND), 
we  then  implement  the  shape  recognition  by  (X  0  ft)  n  (X  0  (M/R)).  Replacing  R  by  ftj  and 
(M/R)  by  R-i,  we  obtain  the  hit  or  miss  transform  (template  matching)  for  shape  recognition. 


Appendix  F 

Proof  of  Theonm  3.4'.  , 

(1)  The  straight  forward  way  for  removing  the  “pepper”  noise  is  the  thinning  operation 
X  @R*  (or  X  (3)ft«j.  Follow  this,  we  have 

*©ft4  =  Yu(Tef)u(X6  M4) 

=  XuXu(X©  Af4) 

=  Xu  (XnXeM4)  . 

=  (XuX)n(tu TOT 
=  W  n  (X  u  X  0  Af4j 
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(2)  The  straight  forward  way  for  removing  the  “pepper”  aoise  is  the  thickening  operation 


X  0<?4  (or  XQ  Qs).  Follow  this,  we  have 


XO<?4  =  Xu(Y0.Vf4)u(X0/) 

x  u  (Ye  iU)  u  x 
X  u  (Y  ®  MA  n  Y) 

(. X  u  Y  0  Af<)  n  (X  u  X) 
(X  u  X  0  .kf4)  n  kV 


=  (X  u  X  ®  Ma). 

(3)The  straight  forward  way  for  removing  the  “salt  and  pepper”  noise  the  difference  of 
X  QQ*  by  X  ®  (or  the  difference  of  XQ3s  by  X  ®  R%).  By  a  similar  procedure  as  above 
we  can  achieve  the  desired  result. 

Appendix  G 

Proof  of  Theorem  3.5:  To  extract  the  region  whose  sizes  are  between  two  reference  images 
R  and  Q,  the  straight  forward  way  is  to  design  a  morphological  band  pass  filter  (Theorem  3.2): 

(X  o  R)/(X  o  Q)  =  ((X  0  R)  8  R)/((X  8  3)0  Q). 

To  obtain  the  locations  of  those  desired  regions,  we  then  perform  the  skelotonization: 


S(((X e  R) 8  H)/((X 0  3)0 3))  =  S((X  0  *)/((X e  3)  ®  3)))  *  S((Y 0  R)  u  (Y©  3  0  3))- 
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figure  2(a):  A  ceUuJar  string.  It  requires  onJv 
a  1-D  interconnection  mechanism.  Each  ceil 
connects  with  its  two  nearest  ceils. 
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Figure  1:  A  sequential  process  of  cellular  logic 
operations  (CLO).  The  value  X'{i,j)  is  deter¬ 
mined  by  the  corresponding  X(i,.7)in  the  orig¬ 
inal  image  along  with  the  values  of  its  neigh- 


Figure  2(b):  A  cellular  array.  It  requires  a  2-D 
interconnection  mechanism.  Each  cell  connects 
with  its  4  or  3  nearest  cells. 


Figure  2(c):  A  one-dimensional  cellular  hypercube  [24].  Each  cell  connects  with  cells  at  dis¬ 
tances  1,  2, 4, 8, ....  2fc  from  it.  Here,  only  the  connections  with  distances  1,  2,  and  4  are  shown. 


Figure  2(d):  A  two-dimensional  cellular  pyramid.  It  consists  of  stages  of  arrays  with  connections 
between  two  adjacent  stages  and  requires  a  3-D  interconnection  mechanism. 
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Figure  3:  The  universal  image  W.  It  has  (2n  + 
1)  x  (2 n  +  1)  image  points  and  n  is  a  positive 
integer. 
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Figure  4:  An  example  of  fundamental  opera¬ 
tions:  complement  union  U,  and  dilution  0. 


:  the  pixel  at  coordinate  (x,y) 


:  the  pixel  at  coordinate  (x,y) 


(a)  the  4-neighborhood  of  (x,y). 


(b)  the  8-neighborhood  of  (x,y). 


Figure  5:  The  4-neighborhood  and  8-neighborhood  of  an  image  point  (z,y). 
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I®  X  (a)  A  4-connected  (8-connected  (b‘ An  8-connected  component  of  X. 

too)  component  of  X. 

Figure  6:  The  4-connected  component  and  8-connected  component  of  an  image. 


Image  X 


(a)  The  outside  of  X. 


(b)  The  holes  of  X. 


Figure  7:  The  outside  and  holes  of  an  image. 
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Figure  8(a):  Difference. 
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Figure  8(b):  Intersection. 
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Figure  8(c):  Erosion. 
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Table  1(a):  Basic  properties  of  the  three  fundamental  operations  and  of  three  derived  operations 
(alternative  fundamental  operations). 
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Table  1(b):  Basic  properties  of  some  standard  derived  operations. 


Figure  9(a):  One  kind  of  morphological  low  pass  filter  (opening) 


X  R 

Figure  9(b):  A  second  kind  of  morphological  low  pass  filter  (closing). 


Figure  9(c):  A  morphological  high  pass  filter. 


Figure  10(a):  One  kind  of  shape  recognition.  R  represents  the  shape  to  be  identified,  and  must 
lie  entirely  and  exclusively  in  the  mask  defined  by  M. 


Figure  10<b):  Hit  or  miss  transform,  which  recognizes  locations  of  foreground  points  given  by 
Ri  in  conjunction  with  background  points  given  by  Rj- 
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Figure  14(a):  An  example  of  the  convex  hull  of  an 
of  projections  ). 
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Figure  14(b):  The  difference  of  C(X)  by  X. 


—  Connections  in  the  DOCIP-array4 
=}  Connections  in  the  DOCIP-array8 


Optical  Feedback  Path 


Interconnection  Unit 


N  x  N  Output  Side  of  Array  of  Cells 
(implemented  by  optical  gate  array) 


N  x  N  Input  Side  of  Array  of  Cells 
(implemented  by  optical  gate  array) 


Figure  15:  An  optical  4-connected  or  8-connected  cellular  array  (DOCIP-array4  or  DOCIP- 
arrayS).  Imaging  optics  are  omitted  for  clarity.  Each  cell  connects  with  its  four  nearest  cells 
and  itself  by  optical  3-D  free  interconnection.  The  input  and  output  sides  of  the  optical  gate 
array  are  interconnected  by  am  optical  feedback  path  and  are  shown  separately  for  clarity. 


Figure  16:  An  optical  4-directed  or  8-directed  cellular  hypercube  (DOClP-hypercube4  or  DOCIP- 
hypercube8).  Each  cell  connects  with  cells  in  the  4  directions  or  8  directions  at  distances 
1,2, 4, 8,..., 2*  from  it  by  optical  3-D  free  interconnection. 
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Figure  17:  A  conventional  hypercube  (4-cube)  laid  out  in  two  dimensional  space.  Its  inter¬ 
connections  have  no  spatial  invariance. 


-  Connections  in  the  4-directed  cellular  hypercube 

=}  Connections  in  the  8-directed  cellular  hypercube 
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Figure  18:  A  two-dimensional  cellular  hypercube  —  DOCIP-hypercube.  Each  cell  is  inter¬ 
connected  with  other  cells  having  a  relative  one  bit  difference  in  coordinate  label  in  positive 
or  negative  z  and  y  directions  to  achieve  a  spatially  symmetric  and  invariant  interconnection 
pattern.  Only  connections  from  the  central  cell  are  shown;  all  cells  are  connected  identically  so 
the  resulting  interconnections  are  space  invariant. 
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Figure  19:  A  digital  optical  cellular  image  processor  (DOCIP)  architecture  —  one  implementa 
tion  of  binary  image  algebra  (BLA).  The  DOCIP-array  requires  9  (or  5)  control  bits  for  reference 
image  £,.  The  DOCIP-hypercube  requires  O(logN)  control  bits  for  reference  image  £,. 
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Table  2:  Cellular  image  processor  execution  times  for  N  x  JV  image  data.  It  roughly  com¬ 
pares  the  execution  time  for  the  conventional  electronic  array  processor,  the  DOCIP-array  and 
the  DOCIP-hypercube. 
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Figure  20:  The  input  image  X. 


Reference  Image  R  Reference  Image  Q 

Figure  21:  The  reference  images  R  and  Q. 


Figure  22:  The  expected  output  image  V. 
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Figure  23:  An  allowed  reference  image  E  at  a  clock  cycle  in  the  DOCIP-arrayS  (.also  allowed  in 
DOCIP-hypercubeS)  and  its  corresponding  9  (or  33)  bits  in  instruction  (nini...r>;.)  for  control 
ling  the  neighborhood  mask  (i.e.  the  reference  image  for  the  dilation). 
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Figure  24:  An  allowed  reference  image  P  at  a  clock  cycle  in  the  DOCIP-hypercubeS  (not  al¬ 
lowed  in  the  DOCIP-arrayS)  and  its  corresponding  33  bits  (assume  31  x  31  cells)  in  instruction 
(n!n2...n33)  for  controlling  the  neighborhood  mask  (i.e.  the  reference  image  for  the  dilation). 


Figure  25:  The  locations  of  the  desired  objects  in  the  output  image  V 
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Summary 

Binary  image  algebra  (BIA),  a  unified  systematic  complete  theory  of  parallel  binary  image  processing  [1],  also  provides  a 
unified  spatial  logic  of  digital  optical  computing  for  describing  symbolic  substitution,  cellular  logic  and  Boolean  logic  in  parallel 
Symbolic  substitution  has  been  used  to  implement  logic,  arithmetic,  communication  and  simulating  a  Turing  machine  [3];  but  its 
implementation  of  some  operations  (e.g.  parallel  binary  arithmetic)  is  relatively  complicated  to  other  BIA  implementations  [3],  In 
this  paper  we  further  suggest  some  BIA  algebraic  techniques  and  pattern  recognition  algorithms,  including  a  shift,  scale  and  rotation 
invariant  algorithm,  to  improve  the  speed,  flexibility  and  complexity  of  symbolic  substitution. 

A  symbolic  substitution  rule  involves  two  steps:  1)  recognizing  the  locations  of  a  certain  spatial  search-pattern  within  the  2-D 
input  data,  and  2)  substituting  a  new  replacement- pattern  wherever  the  search-pattern  is  recognized.  As  illustrated  in  Fig.  1,  BIA 
can  be  used  to  realize  a  symbolic  substitution  rule  defined  by: 

(.Y®ft©Q  =  ((A*©ft)n(Xeft))©Q  =  (X©ft)u(.Y©  ft)©Q  ,1, 

where  .Y  is  the  2-D  input  data.  R  =  (ft ,  ft)  is  the  reference  image  pair  corresponding  to  the  search-pattern  (ft  and  ft  define  the 
foreground  and  the  background  of  the  search-pattern  respectively),  A  defines  a  reflected  reference  image  given  bv  R  =  {(-r.  -y)  I 
(r,y)  6  ft,  Q  is  the  reference  image  corresponding  to  the  replacement-pattern,  “  ®”  denotes  the  hit  or  miss  transform  which  is 
the  pattern  recognizer,  “©’  denotes  the  erosion  operation,  and  “©”  denotes  the  dilation  operation  which  is  the  pattern  replacement 
operator.  To  work  with  more  than  one  rule  (say  p  substitution  rules)  for  practical  applications,  a  symbolic  substitution  system  (Fig. 
2)  produces  several  copies  of  the  input  X ,  provides  p  different  recognizer-substituter  units,  and  then  combines  the  outputs  of  various 
units  to  form  a  new  output.  Thus,  a  symbolic  substitution  system  is  implemented  by 

U(X  ®tf*>)  BQ(,)  (2) 

*»1 

where  ft0  and  Q{'\  i  =  1,2 . p,  are  the  reference  image  pairs  and  replacement  patterns  in  the  i"'  symbolic  substitution  rule. 

This,  then,  is  the  BIa  formula  for  general  symbolic  substitution. 

However,  in  many  cases  the  above  form  is  inefficient  and  can  be  reduced  to  a  relatively  simpler  form  or  implemented  in  a  more 
efficient  way  by  using  some  BIA  algebraic  techniques.  Here  are  some  examples:  1)  the  fall  recognition  can  be  implemented  by  only 
the  background  or  foreground  recognition  under  certain  conditions;  2)  if  Q(,)  =  4>,  the  i1'*  symbolic  substitution  rule  in  Eq.  (2)  is 
not  needed  (e.g.  the  four  rules  of  binary  subtraction  in  simple  intensity  coding  of  arithmetic  data  can  be  reduced  to  only  two  rules 
[2]);  and  3)  if  =  Q  for  all  1  <  i  <  p  (this  happens  in  those  cases  that  a  class  of  search-patterns  is  defined  by  a  set  of  reference 
image  pairs  ft’\  1  =  1,2, ...,  p),  we  should  combine  the  results  of  the  hit  or  miss  transforms  first  and  then  replace  them  by  the  same 
replacement-pattern  Q  instead  of  implementing  p  substitution  units  for  realizing  the  same  substitution  step,  i.e. 

r 

((JA®A(,))©<3.  (3) 

1*1 

The  practical  difficulty  with  the  implementation  in  Eqs.  (2)  and  (3)  is  that  the  hit  or  miss  transform  is  only  efficient  for  the 
shift  invariant  recognition  and  would  require  a  large  number  of  intricate  reference  image  pairs  to  perform  the  recognition  step  in  the 
presence  of  changes  in  scale,  rotation  or  both.  Thus,  it  might  be  too  costly  to  implement  scale  and  rotation  invariant  recognition  of 
intricate  patterns  for  symbolic  substitution  based  on  the  above  formula.  For  example,  if  we  want  to  substitute  all  “square  patterns'* 
in  an  input  image  by  the  same  character  “S",  it  would  be  very  inefficient  to  use  the  above  symbolic  substitution  implementation 
techniques. 

To  solve  this  kind  of  scale  and  rotation  invariant  problem,  here  we  recognize  all  the  desired  patterns  by  reversing  the  growing 
procedure  of  a  family  of  patterns.  This  family  defines  all  patterns  in  the  presence  of  changes  in  scale,  rotation  or  both,  and  transforms 
all  the  desired  patterns  into  their  original  seeds,  which  are  isolated  single  image  points.  We  have  developed  a  description  of  this 
procedure  in  terms  of  BIA.  For  brevity,  here  we  describe  only  the  case  of  shift  and  scale  invariant  recognition.  Suppose  we  want  to 
recognize  all  square  patterns  with  different  scales  and  locations  in  the  input  image  X  (e.g.  Fig.  3(a))  and  to  produce  the  output 
image  Y  (e.g.  Fig.  3(b)).  The  procedure  is:  1)  determine  a  growing  sequence  of  the  desired  patterns  T,  (e.g.  Fig.  3(c)),  where 
0  <  i  <  m  and  the  largest  size  of  the  desired  patterns  iimxm;  2)  find  a  small  set  of  good  reference  image  pairs  (ft#)}  (e.g.  Fig.  3(d) 
has  only  5  small  reference  image  pairs  for  recognizing  all  square  objects  with  different  scales)  satisfying  some  criteria,  where  each 
reference  image  pair  in  (A(0)}  corresponds  to  a  possible  neighborhood  of  a  given  foreground  image  point  in  a  pattern  T,,  1  <  i  <  m, 
whose  previous  state  in  the  pattern  T.-i  is  a  background  point;  3)  transform  the  desired  patterns  T,.  i  =  1,2, ....  m,  in  the  2-D  input 
image  .Y  =  -Y(to)  into  their  original  seeds  (i.e.  To  which  contains  one  and  only  one  foreground  image  point)  by  the  recursive  relation 
,Y((k+i)  =  -Y(ii.)/IJ#€a  «Y(t*)  ®  R(8),  where  0  <  k  <  m;  and  4)  pick  up  the  original  seeds  by  Y  =  .Y(tm)  ®Q,  where  Q  (Fig.  3(e)) 


113 


i3  a  reference  image  parr  with  one  and  only  one  foreground  image  point  at  the  center  and  Y  is  the  final  recognition  output  3v 
selecting  good  reference  image  pairs  associated  the  growing  sequences  of  rotation  patterns,  we  can  extend  shift  and  scale  invariance 
to  include  rotation  invariance  in  a  similar  way.  This  algorithm  can  efficiently  reduce  the  computation  complexity  for  a  certain  class 
of  pattern  recognition  and  symbolic  substitution  problems;  their  computation  times  depend  only  on  the  diameter  of  the  largest 
desired  pattern,  but  not  on  the  number  of  patterns  nor  the  size  of  the  whole  image. 

A  digital  optical  cellular  Image  processor  (DOCIP)  [l]  [2]  Implements  all  the  above  algorithms  of  symbolic  substitution  and 
pattern  recognition  in  a  flexible  and  efficient  way  compared  to  a  symbolic  substitution  processor  (Fig.  2)  with  p  fixed  recognizer- 
substituter  units.  The  DOCIP  programming  for  these  algorithms  will  be  illustrated. 
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Figure  1  BIA  representation  of  symbolic  substitution.  The 
optional  mask  .Vf  is  for  controlling  the  block  search  region. 


Figure  2.  A  symbolic  substitution  system  with  p  symbolic 
substitution  rules. 
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(c)  The  growing  sequence  of  square  patterns  T,,  o  <  i  <  4. 
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(d)  A  set  of  good  reference  image  pairs  {/?(#)}  for  square 
patterns  with  different  scales. 


1:  foreground  points  with  value  1 
b:  background  points  with  vaiua  0 

(e)  The  reference  image  pair  Q. 


Figure  3.  A  shift  and  scale  invariant  pattern  recognition  of 
sqnare  patterns. 
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2.2  Digital  Optical  Cellular  Architectures 


The  papers  reprinted  in  this  section  discuss  details  of  optical  cellular  architectures  and  their  in¬ 
struction  set. 

The  DOCIP  is  a  2-D,  page  oriented  array  of  individual  processors  located  at  every  pixel  of  a 
large  image.  The  attached  paper  by  K.S.  Huang,  B.K.  Jenkins  and  A. A.  Sawchuk,  “Binary  Image 
Algebra  and  Optical  Cellular  Logic  Processor  Design”,  submitted  to  Computer  Vision,  Graphics 
and  Image  Processing ,  summarizes  some  of  these  concepts  and  their  algebraic  background.  Fol¬ 
lowing  this  paper  is  “Optical  Symbolic  Substitution  and  Pattern  Recognition  Algorithms  Based 
on  Binary  Image  Algebra”,  by  K.S.  Huang,  B.K.  Jenkins  and  A. A.  Sawchuk,  from  the  ICO  Topi¬ 
cal  Meeting  on  Optical  Computing,  Toulon,  France,  1988,  which  contains  additional  information. 

This  paper  is  concerned  with  the  hardware  implementation  of  one  cell  of  a  prototype  digital 
optical  cellular  image  processor  (DOCIP). 
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Abstract 

A  processing  element  of  a  prototype  digital  optical  cellular  image  processor  (DOCIP)  is  implemented  to 
demonstrate  a  particular  parallel  computing  and  interconnection  architecture. 


Submitted  to  ICO  Topical  Meeting  On  Optical  Computing,  Toulon,  France,  August  29  -  September  2,  1988. 
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^  Summary 

I 

Digital  optical  cellular  image  processor  (DOCIP)  architectures,  DOCIP-array  and  DOCIP-hypercube,  can  perform 
the  tasks  of  parallel  binary  image  processing  and  parallel  binary  arithmetic  [1].  The  use  of  optical  interconnections  per¬ 
mits  a  cellular  hypercube  topology  to  be  implemented  without  paying  a  large  penalty  in  chip  area  (the  cellular  hypercube 
interconnections  are  space-invariant  which  implies  relatively  low  hologram  complexity);  it  also  enables  images  to  be  input 
to  and  output  from  the  machine  in  parallel.  Table  1  gives  a  comparison  of  three  different  interconnection  networks:  cel¬ 
lular  array  (DOCIP-array  interconnection  network),  conventional  hypercube,  and  cellular  hypercube  (DOCIP-hypercube 
interconnection  network).  In  this  paper  we  experimentally  demonstrate  the  concept  of  the  DOCIP  architecture  by  imple¬ 
menting  one  processing  element  of  a  prototype  optical  computer  including  a  49-gate  processor,  an  instruction  decoder, 
and  electronic  input/output  interfaces. 

A  multiple-exposure  multi-facet  interconnection  hologram  provides  the  fixed  interconnections  between  the  outputs 
and  the  inputs  of  an  array  of  7  x  7  optical  gates.  The  input  data  and  the  instructions  are  supplied  from  an  LED  array. 
The  outputs  of  optical  gates  are  detected  by  a  video  camera  and  compared  with  the  results  of  a  software  simulation.  A 
diagram  of  the  main  components  of  this  experimental  system  is  shown  in  Fig.  1. 

A  space-variant  interconnection  system  [2]  for  wi t hin- processor  interconnection  is  used  in  this  experimental  demon¬ 
stration.  A  computer  controlled  system  is  used  to  make  an  array  of  49  interconnection  subholograms.  An  optical  point 
source  S,  whose  position  is  controlled  by  the  mirror  M2  with  two  rotational  stages  (Fig.  1),  is  used  to  provide  an  object 
beam  for  determining  an  interconnection  of  a  subhologram  in  the  multi-facet  hologram.  A  mask  with  a  circular  apertnre, 
controlled  by  two  translational  stages,  is  used  to  determine  the  sizes  and  positions  of  subholograms  in  a  holographic  plate. 
The  interconnection  hologram  for  this  49-gate  optical  processing  element  comprises  49  subholograms,  which  are  laid  out 
in  a  7  x  7  array.  Each  subhologram  covers  a  circular  area  with  a  diameter  of  l.S  mm.  The  spacing  between  the  centers 
of  two  subholograms  is  3.0  mm.  Note  that  the  path  of  the  object  beam  and  the  mask  for  subholograms  are  only  used  for 
making  the  interconnection  hologram;  they  are  blocked  or  moved  when  we  reconstruct  the  hologram  to  implement  the 
interconnections  of  the  optical  gates.  We  use  a  volume  phase  hologram  with  a  dichromated  gelatin  medium  for  obtaining 
high  diffraction  efficiencies. 

The  array  of  7  x  7  optical  gates  is  implemented  by  a  Hughes  liquid-crystal  light  valve  (LCLV)  with  liquid-CTystal 
'  molecules  in  a  45*  twisted  nematic  configuration  [2].  The  LCLV  is  read  out  between  crossed  polarizers  and  is  biased  to 

implement  a  NOR  operation.  The  gate  size  in  this  experiment  has  a  diameter  of  0.3  mm  and  the  spacing  between  the 
centers  of  two  gates  is  0.6  mm. 

The  circuit  diagram  of  the  processing  element,  as  shown  in  Fig.  2,  consists  of  49  NOR  gates  with  maximum  fan-in 
of  3  and  fan-out  of  4.  The  processing  element  includes  a  3-bit  destination  selector,  a  3-bit  master-slave  flip-flop  memory, 
a  6-bit  memory  selector  with  a  union  module,  and  a  5- bit  neighborhood  selector  (for  DOCIP-array4  [1])  with  a  dilation 
module.  This  experimental  DOCEP  system  has  one  instruction,  supplied  from  an  LED  array  and  decoded  by  the  optical 
hardware  This  instruction  has  the  format:  (c,  di, dj,  ds.S; , sj,...,s«,  »i, n2,  ...,ns)  where  c  selects  the  image  from  the 
input  or  from  the  feedback;  di.dj,  and  d]  select  the  destination  memory  for  storing  the  image;  st.sj, ...,ss  select  the 
output  from  the  memory  elements;  and  nj ,  nj,  ...,»$  control  the  neighborhood  m:sk,  i.e.  supply  the  reference  image.  We 
will  experimentally  demonstrate  the  DOCIP  architecture  concept  with  this  system. 
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Table  1.  A  comparison  between  three  different 
interconnection  networks  of  NxN  processeng 
elements  (PEs):  cellular  array,  conventional 
hypercube  and  cellular  hypercube.  When  laid  out 
on  a  VLSI  chip,  both  the  conventional  hypercube 
and  cellular  hypercube  pay  a  large  penalty  in  chip 
area  while  the  cellular  hypercube  has  a  relatively 
low  hologram  complexity. 


Figure  1.  Experimental  DOCIP  system.  Lens  LI  images 
from  the  LCLV  gate  output  plane  to  the  hologram  plane. 
Beam  Splitter  BS3  combines  the  external  input  signals 
from  LEO  array  and  the  feedback  signals  from  intercon¬ 
nection  hologram.  LP1  and  LP2  are  lens-pinhole 
assemblies.  PI  and  P2  are  crossed  polarizers.  The  holo 
gram  comprises  an  array  of  subholograms.  Mirror  M2 
conlrols  the  position  of  point  source  S  during  hologram 
exposure.  After  the  hologram  is  made,  the  mask  and  ail 
components  in  the  path  from  BS1  to  the  hologram  are 
not  needed. 


Figure  2.  The  circuit  diagram  of  a  49-gate  processing  element  of  the  DOCIP-array4. 
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